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Abstract 

Due to the large size of wireless networks, it is often impractical for nodes to track changes in the complete 
network state. As a result, nodes have to make distributed decisions about their transmission and reception 
parameters based on their local view of the network. In this paper, we characterize the impact of distributed 
decisions on the global network performance in terms of achievable sum-rates. We first formalize the concept 
of local view by proposing a protocol abstraction using the concept of local message passing. In the proposed 
protocol, nodes forward information about the network state to other neighboring nodes, thereby allowing network 
state information to trickle to all the nodes. The protocol proceeds in rounds, where all transmitters send a message 
followed by a message by all receivers. The number of rounds then provides a natural metric to quantify the extent 
of local information at each node. 

We next study three network connectivities, Z-channel, a three-user double Z-channel and a reduced-parametrization 
K-usei stacked Z-channel. In each case, we characterize achievable sum-rate with partial message passing leading 
to three main results. First, in many cases, nodes can make distributed decisions with only local information about 
the network and can still achieve the same sum-capacity as can be attained with global information irrespective of 
the actual channel gains. We label such schemes as universally optimal. Second, for the case of three -user double 
Z-channel, we show that universal optimahty is not achievable if the per node information is below a threshold. In 
fact, distributed decisions can lead to unbounded losses compared to full information case for some channel gains. 
Third, using reduced parametrization K-usei channel, we show that very few protocol rounds are needed for the 
case of very weak or very strong interference. However, in other regimes, 0{K) rounds are essential to achieve 
sum-capacity. 



I. Introduction 

Due to mobility, the network connectivity and channel gains in a wireless network are constantly time- 
varying. For nodes to make optimal decisions about their transmission and reception parameters, like rate, 
power, codebooks and decoders, they require full knowledge of the state of the network (defined as the 
network connectivity and the channel gains on each link) to compute the capacity region and thus their 
own operational point. However, in large wireless networks, centralizing complete information about the 
network state implies prohibitive overhead and thus, seldom performed in any practical network. As a 
result, nodes have to make distributed decisions about their transmission and reception parameters based 
on their limited local view of the network state. In this situation, the driving question is can and when 
do distributed decisions lead to globally optimal network operation? 

In this paper, we consider single-hop interference channels [3-5] where the receiver for each transmitter 
has a direct connection to its receiver but otherwise the network connectivity is arbitrary. The network 
state is not known to any node in the network. As a result, none of the nodes know the set of jointly 
achievable rates and the associated capacity-achieving transmission schemes. In contrast, prior work in 
quantifying the network capacity implicitly assumes that each node in the network knows the full network 
state perfectly, e.g. [1-3]. 

To understand network performance with partial information about the network, we need a natural 
metric to quantify extent of network information each node has about the network. Towards that end, we 
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propose a protocol abstraction in the form of a local message passing protocol, where the nodes propagate 
messages related to network state information. The protocol abstraction is inspired by the fact that in a 
network, the only feasible mechanism available for nodes to learn any information is to pass messages 
to their neighbors. In fact, local message passing is the building block in all network protocols, like 
medium access, routing and gossiping. Inspired by belief propagation algorithm commonly used in LDPC 
decoding [6], the proposed message-passing algorithm proceeds in rounds, where one round consists of 
a forward and a reverse phase. In the forward phase, each transmitter sends a message and in the reverse 
phase, each receiver responds with a message. Each message constitutes of only the new information and 
thus is similar to extrinsic information in belief propagation. While there are many parallels between the 
proposed algorithm and belief propagation, we will not explore them further in this paper. 

The message passing protocol exposes the fundamental capacity problem of interest. With each round 
the nodes learn more about the network but they do not have full network information till the message 
passing protocol has terminated. As a result, in the intermediate rounds before the termination of the 
protocol, not only the nodes have incomplete network information, they may have mismatched view of 
the network. Thus, we propose to use the number of protocol rounds as a proxy to quantify extent of 
network information each node has about the network state. 

The characterization of capacity under partial network state information is non-trivial since the exact 
network capacity with full information is still unsolved. Thus, we consider three special cases: Z-channel, 
three-user double Z-channel (two Z channels stacked on top of each other), and a K-user stacked Z- 
channel. In each case, we focus on the sum-rate point on the capacity boundary. Each of the three network 
connectivities has the largest network diameter for a given number of users, and thus the message passing 
requires maximum number of rounds to ensure that all nodes have full network information. For the cases 
when the protocol has not terminated, the resulting network problem is often labeled as hidden node 
problem in network protocol literature [19]. While there is a rich body of literature to design protocols to 
counter the issues related to hidden nodes, the authors are not aware of any information theoretic capacity 
analyses with hidden nodes. 

We seek universally optimal strategies, where each node decides its action based only on its own view 
of the network but the resulting network sum-rate is equal to the sum-capacity with global information at 
all the nodes for all network states. Our results are derived for both deterministic and Gaussian channels, 
and are summarized as follows. 

1) Z-channel: For the Z-channel, the message-passing protocol requires three full rounds to terminate. 
In the case of deterministic Z-channel, we show that a unique universally optimal scheme exists with 
only one and a half rounds of message-passing protocol, even though one of the transmitters does 
not know all the channel gains. The key feature of the scheme is that transmitters are politely as 
greedy as possible but do not hurt transmission of any other flow about which they have knowledge. 
The deterministic case is extended to the case of Gaussian Z-channel, where we show that the sum- 
capacity within 2 bits can be obtained with one and a half rounds of message passing. Note that at 
least two full rounds are required for all nodes to learn the full network in a Z-channel. 

2) Double Z-channel: For the case of deterministic double Z-channel, four rounds of message passing 
are needed. We build on the Z-channel result to propose a distributed rate allocation for double 
Z-channel and characterize the resulting sum-rate after one and a half rounds, and two and a half 
rounds, which are in general different. Our result shows the growth in achievable sum-rate with more 
information about the network. In this case, two and a half rounds suffice to obtain a universally 
optimal scheme. However, the scheme is no longer unique. For the case of one and a half rounds, 
we also show a converse result that there exists no distributed scheme which can be universally 
optimal. In fact, our proposed scheme is optimal for some network states but can have arbitrarily 
large losses in other cases. Thus, this is the first indication that partial information can significantly 
reduce network capacity and that loss in network capacity is unavoidable, i.e, every strategy will 
be suboptimal in certain regime of channel gains. 

In order to prove the above results, the capacity region for the deterministic three user double-Z 
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channel is found in this paper for a general class of deterministic channels which is later specialized 

to the deterministic models. The above results have further been extended to a Gaussian model. For 
the Gaussian double Z-channel, the sum capacity within 4 bits can be achieved with 2.5 rounds of 
message passing. However for 1.5 rounds, sum-rate optimality can only be guaranteed for a subset 
of channel gains. To derive the Gaussian result, we derived novel outer bounds for the sum capacity 
of the Gaussian double Z-channel. Interestingly, we find that treating interference as noise which is 
optimal for weak interference for Z-channel is only optimal for double-Z channel in the cases of 
very weak interference. 

3) K-user Z-Channel: For a general X-user stacked Z-channel, we consider a reduced parametrization 
where all direct links have identical gain and all cross-links are of the same value. Thus, there are 
only three unknown parameters, size of network K, direct link gain and the cross link gain. For both 
deterministic and Gaussian case, we show that one round is sufficient to achieve sum-capacity if the 
ratio a of cross-link gain to direct link gain is less than 1/2 (very weak interference) or greater than 
2 (very strong interference). In the first case {a < 1/2), flows treat interference as noise and thus 
learning about other parts of the network is not useful. In the second case (a > 2), interference can 
be completely cancelled out and thus each node can be greedy without requiring any information 
from other nodes. For a G (1/2, 2/3], no more than two rounds are required to achieve optimality. 
And for all other values of a e (2/3,2), 0{K) rounds are required to achieve optimaUty. This 
suggests that network could measure itself adaptively by using more rounds only if certain channel 
gains are detected in the first round. For the fC-user case, extension to Gaussian case is provided for 
a subset of a values. We find that for very weak interference and very strong interference, one round 
of message passing suffices to achieve the sum capacity for symmetric Gaussian stacked Z-channel. 
We make two salient observations about the message passing protocol and its relation to other metrics. 
First, after d full rounds of message passing, each transmitter knows all channels which are 2d hops away 
and each receiver knows links up io 2d — \ hops away. After d.h rounds, receiver information increases 
to 2(i -f- 1 hops. Thus, number of protocol rounds directly relate to a common method to specify local 
information in network protocol analyses. Second, the messages can be easily related to practical network 
operation. For example, 1.5 rounds is a common choice in cellular systems, translating to beacons from the 
base- stations, feedback by the mobile units followed by last half round of a message from the base-station 
indicate rate and power decisions. 

The rest of the paper is organized as follows. In Section n, we describe the channel models. In Section 
III, we give a general message passing protocol for a K user interference channel. In Section IV, we 
characterize the sum-rate with partial information at the nodes for a deterministic and Gaussian Z-channel. 
In Section V, we find the capacity region for a deterministic double Z interference channel and outer 
bounds for Gaussian deterministic channel. The capacity region is found for a general class of deterministic 
channels on the lines of [12] which is then specialized to a specific deterministic channel model considered 
in the paper. We also derive results for 1.5 and 2.5 rounds of message passing for both deterministic and 
Gaussian channels. In Section VI, we consider a symmetric K-user one-sided channel with unknown K 
and find the increase in the sum capacity for varying rounds of message passing algorithm. Section VII 
concludes the paper. 

II. Problem Formulation 

A. Channel Models 

We will consider two models for interference channels with K transmitters {Ti}f^^ and K receivers 
{^i}f=i- ^ deterministic model [3] and the additive noise Gaussian model as described follows. We assume 
that both transmitters and receivers can transmit messages, that is the network is bi-directional in nature. 

1 ) Deterministic Model: In a deterministic interference channel, the inputs of k^^ transmitter at time i 
are denoted by Xki = [Xki^ Xki^ ■ ■ ■ ^fcij e {0,1^, k = 1,2, ■ ■ ■ , K, such that Xki^ and Xki^ are the 
most and the least significant bits, respectively. The received signal of user j, j — 1,2, - ■ ■ , K, at time i 
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is denoted by the vector Yji — [Yji^ Yji^ . . . Yji^] e {0, 1}*. Specifically, the received signal Yji, of an 
interference channel is given by 

K 

^ji = 0S''-"'=^Xfe, (1) 
k=l 

where © denotes the XOR operation, and S^"'**^ is a q x q shift matrix with entries Sm,n that are non- 
zero only for {m,n) = {q — Ukj + n,n),n — 1,2, .. . ,njk. We will also use XJ!, Y^, etc. to denote 
(Xfei, • • • , Xkn), (Yfci, ■ ■ ■ , Yfcn), etc. Associated with each transmitter k and receiver j is a non-negative 
integer rikj that defines the number of bit levels of observed at receiver j. The maximum number of 
bits supported by any link is g = maxfcj(nfcj). The network can be represented by a square matrix H 
whose (i, j)*'*^ entry is i/jj = riiy We note that B. need not be symmetric. 

2) Gaussian Model: In a Gaussian interference channel, the inputs of k^^ transmitter at time i are 
denoted by Xki G C, k — 1,2, ■ ■ ■ , K, and the outputs at j*"^ receiver in time i can be written as Yji e C, 
j = 1, 2, • • • , /T. The received signal Yji, j = 1, 2, • • • ,K is given by 

K 

Yji — ^2 ^kj^ki + Zji, (2) 

fe=i 

where hkj e R"*" is the channel gains associated with each transmitter k and receiver j, and Zji are 
additive white complex Gaussian random variables of unit variance. We will also use X]^, Y^, etc. to 
denote (X^i, • • • (lli,-- - etc. Further, the input X^i has an average power constraint of 

unity, i.e. E(|Xjti|^) < 1 (where E denotes the expectation of the random variable). Only in Section III, 
we will make an exception where no power constraint will be imposed on messages sent by transmitters 
or receivers. 

Like the deterministic case, we represent the network state by a square matrix H whose {i.jf^ entry 
is Hij = Thus we will use the matrix H for both the deterministic and the Gaussian model, where 

the usage will be clear from the context. 

B. Per Node Local View 

Our objective is to understand the impact of nodes' decisions on network sum-rate, when the decisions 
are based on their partial information about the matrix H. For transmitters we will denote this partial 
information about the network as and as A^^ for the receivers. If the nodes know nothing about the 
network matrix H (i.e no information about its size or entries), then Nk = Nj^ = ^ (empty set), which 
is equal to assuming that there is no other node in the network. On the other hand, if the nodes know 
everything about the network, then — — H and is also the most commonly assumed scenario in 
most information-theoretic analyses [4, 15, 18]. 

We now define network state and network connectivity. We assume that that there is a direct link 
between every transmitter Tj and its intended receiver Dj. On the other hand, if a cross-link between 
transmitter i and receiver j does not exist, then Hij = 0. Given a network, its connectivity is a set of 
edges E = {(Tj, Dj)} such that a link Tj — is not identically zero. Then the set of network states, Q, 
is the set of all weighted graphs defined on E. For the deterministic model, the set of network states can 
be written as 

GiE) = {H : Hij = if {Ji, Dj) ^ E else Hij e {0, 1, . . . , q}} , 
and in the Gaussian model as 

g{E) = {H : Hij = if (T^, Dj) ^ E else Hij > 0} . 
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Note that the channel gain can be zero but not guaranteed^ to be if the node pair (Tj, D^) e E. 

Our main focus is the case where N^, for each k is only a subset of the whole matrix. Thus, each 
node knows the network matrix partially. In fact, it is quite possible that nodes know only a few entries 
of the matrix and do not know the size of the whole matrix H, i.e network size. As we will see later, 
this partial knowledge of the network matrix at each node leads to the case where each node's knowledge 
about the network is mismatched from other nodes in the network. That is, each node possibly knows a 
different part of the whole matrix H. We will study the achievable sum-rate as the network information at 
each node grows from no-information to full information. In Section III, we will define a special trajectory 
of sequence of growing network information which is directly connected to protocols in practical systems 
and is also related to commonly used metric of 'number of hops' to denote amount of side information at 
each node. To aid analysis, we will assume that all nodes are provided some side information, SI, about 
the network state before the onset of the protocol. Thus, nodes may have non-zero information about the 
network before even a single message is sent. We next define sum-capacity. 

C. Sum Capacity 

First consider the X-user deterministic interference channel. For each user k, let message index be 
uniformly distributed over {1,2, ...,2"^*}. The message is encoded as using the encoding functions 
ek{mk\Nk,S\) : {1, 2, . . . , 2"^'=} i-^ {0,1}"'^, which depend on the local view, A^^, and side infor- 
mation about the network, SI. The message is decoded at the receiver using the decoding function 
4(^fc"|-/Vfc,SI) : {0,1}"'' ^ {1,2, ...,2"^'=}, where iV^ is the receiver local view and SI is the side 
information. The corresponding probability of decoding error \k{n) is defined as Pr[mk ^ A^(., SI)]. 

A rate tuple /?2, ■ ■ ■ ; Rk) is said to be achievable if there exists a sequence of codes such that the 
error probabilities Ai(n), • ■ ■ Xxin) go to zero as n goes to infinity. The closure of the set of achievable 
rate tuples is defined as the capacity region C. 

Now, consider the K-user Gaussian interference channel. For each user k, we again assume that the 
message index is uniformly distributed over {1, 2, 2"^*=}. Further, we use the same notation for 
encoding and decoding functions. Thus, encoding functions are ek{mk\Nk,S\) : {1,2, ...,2"^*} — C" and 
decoding functions are gk{Y^'\N',^,S\) : C" — > {1,2, ...,2"^'=}. The corresponding probability of decoding 
error Afc(n) defined as Pr[mfc 7^ gk{Y^\Nj^, SI)]. A rate tuple i?2, • • • , Rk) is said to be achievable 
if there exists a sequence of codes such that the error probabilities Ai(n), • ■ ■ Xxin) go to zero as n goes 
to infinity. Again, the closure of the set of achievable rate tuples is defined capacity region C. 

The sum capacity in both cases is defined as 



We note that all encoding and decoding functions depend only the local and side information at the 
transmitters and receivers about the network. In this case, the nodes have to operate with the local 
knowledge N^, Nj^ and the side information SI so that the probability of error at the receivers go to zero 
as n goes to infinity for all G ^(E), leading to a compound channel capacity formulation. In this paper, 
optimal sum capacity refers to the sum capacity with the full state information at all the nodes. 

When the local information about the network is mismatched, the nodes can take actions which can 
work against each other and in the process reduce the sum-rate of the network. This issue of making 
distributed decisions about rate, power, codebooks and decoder is fundamental in most networks and is 
the main topic of study in this paper. 

'The model is inspired by fading cliannels, wliere the existence of a link is based on its average channel gain. On the average the link 
gain may be above noise floor but its instantaneous value can be below noise floor. 




(3) 
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III. Learning Network State Information 

In this section, we describe a protocol which uses local message passing to propagate network state 
information to the nodes in the network. The protocol is described in terms of entries of matrix H = [Hij] 
and thus applies to both deterministic and Gaussian models. Our motivation is to find the most relevant 
cases of local view {Nk^Nj^} we should consider. 

A. Why Learn the Network ? 

Before we dive into the details of learning the network state information, it is important to understand 
why we might need to even estimate and propagate network state information. One could adopt a "non- 
coherent" approach, where no resources are wasted in estimating any channel or network connectivity, 
and nodes code such that reliable communication is possible without any network state information, i.e, 
Nk — N'l^ — ^. However, in compound capacity formulation, the capacity region with no information 
(local or side) about the network is a singleton, where the only possible rate tuple is all zero-rate tuple. 
This follows directly from point to point Gaussian channel where the compound capacity is zero if the 
link state is unknown to the transmitter since in worst case the link gain can be zero. Thus, to achieve a 
non-zero rate, the network information at nodes should be non-trivial. 

The obvious next question is what cases of network information {Nk, A^(.}fc should one consider. For 
a K-usQV network, the matrix has entries, which implies that the per node information can be any 
of 2^^ cases. Thus, there are 2^^'^ possible combination of side information cases. This large number of 
cases quickly becomes intractable. However, we contend that most of these side information cases are not 
of practical interest. 

A common metric to capture extent of network view at each node is number of hops (e.g see [20, 
and references therein]). That is equivalent to each node k knowing a sub-matrix of H. The metric is 
motivated by message-passing algorithms which broadcast and forward information about local state. 
A clear advantage of this metric that it greatly reduces the number of local information sub-cases one 
needs to consider and there is a direct relation with actual protocols which gather this side information. 
We propose to adopt a related metric which is equally concise and tightly related to protocols in many 
operational networks. 

B. Message Passing Protocol 

For nodes to learn and propagate the network state, they have to communicate with each other. This 
inter-node communication is possible only with nodes to which there is a direct link, i.e, messages have to 
be exchanged locally and those messages are then processed and propagated to other nodes. This obvious 
construct of local message passing is central to all multi-hop network protocols. In our development, the 
only practical reality we will be concerned with is that direct communication is possible only between 
neighbors and its impact on amount of network state information at each node. Hence, we will simplify 
some of the implementation complexities as follows. 

The proposed message passing protocol proceeds in rounds, where each full round has two phases: a 
forward phase where all transmitters broadcast a message and a reverse phase where all receivers broadcast 
a message each. We assume that all messages are scheduled so that there are no "collisions" at any of the 
nodes in receiving mode due to simultaneous transmissions. Finally, the broadcast messages can only be 
heard by nodes to which the sending nodes has direct links (the links that are in the network connectivity 
E), thus no extra feedback or Genie channels are available. 

The message broadcasted by the transmitter k in round t (transmitters are data sources) is labeled mk,t, 
which is received by all the receivers j who have direct links to transmitter k. Analogously, the message 
broadcasted by the receiver k at round t is labeled which is received by all the transmitters j who 
have a direct Unk to receiver k. 

Recall that each node's information about the network is represented by either (for transmitters) or N'^ 
(for receivers). Instead of assuming that the nodes have no information to start with (i.e N^ — N'^ — $), we 
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will consider special sub-cases where all nodes have some minimal side information SI about the network. 

The assumption of side information is largely for analytical simplification. This will only change the 
contents of messages and not the message-computation and passing rules. For the following description, 
we will assume that at time t = 0, each node knows the size of the network or SI = {K}. Thus, the 
nodes know the size of the matrix H but do not know any of its entries. An alternate case will also be 
considered where the size of network, K, will be assumed unknown at the onset of the protocol but nodes 
will have a priori knowledge of the form of the matrix H. 

The message passing protocol with knowledge of side information SI = {K} is described below. 

1) Round 1 (Forward): Since none of the entries in the matrix H are known, the first message from 
each transmitter is a known training signal along with the transmitter identity. Thus i = {ipk, T^}, 
where ipk is the training signal from transmitter k. At the end of the transmitter messages, receiver j 
knows the non-zero elements of column j of matrix H learnt via channel estimation (however may 
not know the value of j). 

Round 1 (Reverse): The receiver k broadcasts M^i = UieEj, {{Hi^k:~^i: D^)}, where Ej^ is the set 
of vertices connected to receiver k. Transmitter can receive i if it has a direct link to receiver 
k. This completes the first round. 

2) Round t > 1: In round t > 1, nodes only forward new information which is computed as follows. 
In the forward phase for transmitters, the broadcast message is 



where is the set of vertices connected to transmitter k. The message mk,t is a concatenated version 
of its received messages from previous round minus the messages it has broadcasted in previous 
transmissions and those that are already known to all of its neighbors. 
In response, the receivers broadcasts following in the reverse phase 



The message j is the concatenation of its received message minus its previously broadcasts 
messages and after removing what is known to all its neighboring transmitters. The messages ^ 
and Mk,t are similar to the extrinsic information in belief propagation with the main difference being 
that the messages are broadcasts. 
3) Stopping Rule: If a transmitter or receiver has no new updates, it sends a NULL message in 
its assigned time-slot. Thus, nodes only forward information when new information is received and 
send "nothing" otherwise. When all the neighbors of a node send a NULL message, each node 
stops sending any new messages (even NULL messages). 
The above protocol is similar to message passing for belief propagation on factor graphs, often used 
in LDPC decoding [6]. Belief propagation is closer to gossiping in networks [7], where a node can talk 
only one node at any time. Our proposed approach exploits the broadcast nature of wireless, and hence 
is closer to broadcasting in networks [8]. While connections between broadcast-based generalization of 
belief propagation [9] and proposed message passing are of interest, they are beyond the scope of this 
paper. 

Before we proceed, we quickly note our use of word "message." In this paper, we use messages for 
transmissions which may or may not depend on information bits. In contrast, the usual information- 
theoretic parlance, message often only refers to the "raw" information bits (the information-bearing 
message) sent from sender to the receiver [21]. In our case, the messages do carry information but 
about the network state. Thus, they are similar to control messages, like training, feedback, ARQ etc, in 
networks. 




(4) 




(5) 
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Direction 



Forward 



ODi 




Reverse 



ODi 



OD2 



T2O* 



Round 1 



Di learns {Hn} 
D2 learns {H12, H22} 




Round 2 



mi,2 = {(H,E)} 

7712,2 = NULL 

Di learns whole H 

D2 learns whole H 



Ml 2 = NULL 

Ma.'s = {(Hn,Ti,Di)} 



T2 now knows whole H 



Round 3 



mi_3 = NULL 
77l2,3 = NULL 



Ml 3 = NULL 
M2^3 = NULL 
Message-passing terminates 



Fig. L Full three rounds of message-passing protocol in Z-channel. 



C. Notes on Message Passing Protocol 

It is instructive to consider an example to understand the properties of the message passing protocol; 
Figure 1 shows the steps for the Z-channel. Some key facts are as follows: 

1) Form of Messages: Each non-null message is an unordered set of channel coefficients along with 
the location in the matrix indicated by the transmitter-receiver identity. This ensures that the set 
intersections and union in (4) and (5) are well defined. 

2) Guaranteed Termination in K + 1 rounds: Since the channel estimation is assumed to be perfect and 
messages face no losses, the message passing protocol is guaranteed to converge m K + 1 rounds. 
However, the actual number of rounds depends on the graph diameter. If the interference network is 
fully connected, then message passing terminates after 2 rounds. However, for a stacked Z-network 
(see Section VI), full K + \ rounds are required. 

3) Impact of Side Information: The messages can be reduced in length if any side information about 
the network is known. In the preceding discussion, we assumed that the network size is known and 
hence the size of the matrix is assumed known. As a result, we do not need to estimate the network 
size. 

In Section VI, we will consider a class of i^-user interference channel which is of the form of 
stacked Z-channels. For the i^-user channel, we will assume that the network is parameterized 
by only three parameters K, Hu and H12 where direct channel gains are Hn and H12 are all 
cross channel gains in the network. All the users know that the network is parameterized by three 
parameters but do not know the value of these parameters or their placement in the network. Thus, 
SI = {stacked symmetric Z-channel connectivity} is the side knowledge about network connectivity 
(will be described more precisely in Section VI). Hence, we assume that these are the only three 
parameters that the nodes need to learn besides knowing their relative position in the network (how 
many users above/below a particular user). In the first round (forward), all the receivers except the 
first would know Hu and Hu and hence they can directly tell the transmitters Hn and H12. Thus, 
the only other information which will not be available after the first round is the relative location 
in the network which depends on the total number of nodes K. Let the set of node identities of the 
transmitters that the receiver Dj is connected to be Tj. The message passing protocol simphfies as 
follows. 

Round 1 (Forward): Since none of the entries in the matrix H are known, the first message from 
each transmitter is a known training signal along with the transmitter identity. Thus 1 = {^k, T^}, 
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where z/'fe is the training signal from transmitter k and its node identity. 
Round 1 (Reverse): The receiver 1 broadcasts Mi i = {Hii}{_] while the receiver k >1 
broadcasts M^ i = {{Hn, Hi2)}\J M',^^, where M'-^ = {(Dj,7^)}. Transmitter j can receive Mfe,i 
if it has a direct link to receiver k. 

All the further rounds proceed as before replacing M^^i with M'^ ^, except that mi,2 also includes 
broadcasting ifi2 so that Di knows the cross channel gain. This is possible since Node 1 will know 
that its position in the network after first round based on its side information SI. 

4) Full and Partial Network Information: The case of full information is equivalent to the case of 
message passing operating till termination. With fewer than maximum number of rounds, at least 
one of the nodes may not have full network information. For example, in Figure 1, less than 2 
rounds imply that there is some node (e.g. Ti) which does not have full information. 

5) Mismatched Local Views: The protocol naturally exposes one of the key issues in networks that 
different nodes will have different information about the network state if the protocol is not carried 
to its completion. For large networks, taking the protocol to completion would imply a large number 
of rounds, and is thus impractical. For example, after 1.5 rounds (first full round + only forward 
phase in second round) in Z-channel, T2 does not know Ti ^ Di channel, while other nodes know 
the whole network. We will thus use number of rounds as a proxy for amount of local information, 
with each extra round providing increased information about the network. 

6) Relation to Hop Length: By observing the time-line in message passing in Figure 1, it is straight- 
forward to conclude that different nodes learn about the whole network at different times. Each full 
round allows transmitters to learn two extra hops of information and receivers one extra hop. In d 
rounds, a transmitter will know all routes which are 2d-hop long rooted at that transmiter, and a 
receiver knows routes of length 2d — 1. That is why, in Figure 1, T2 needs 2 rounds to learn about 
Hii, since it is three hops away. 

7) Relation to Practical Protocols: The messages can be translated into practical network operation. 
First round is training, like physical layer preamble in most networks. And rest of the rounds can 
be understood as channel feedback, like often studied in [10]. In practice, networks operate with 
\Qxy few rounds of message passing. For cellular networks, 1.5 rounds is a common choice, roughly 
translating to beacons from base-stations, feedback from mobiles and a rate allocation decision from 
the base-station. Thus, the case of 1.5 rounds will be of special interest to us. 

D. Optimal Strategies 

We now formally define the concept of universally optimal strategy with partial information. Suppose 
that the transmitter i knows local information Ni, the receiver i knows local information and all the 
nodes know side information SI. 

Definition 1 (Universal OptimaUty). A universally optimal strategy with partial information at nodes 
(Ni at transmitter i, N- at receiver i and SI at all the nodes) is defined as the strategy that each of the 
transmitter i uses based on its local information N^ and side information SI, such that following holds. The 
strategy yields a sequence of codes having rates Ri at the transmitter i such that the error probabilities 
at the receiver, Xi{n), • • • Xxin), go to zero as n goes to infinity, satisfying 



for all the sets of network states consistent with the side information. Here Cgum is the sum-capacity of 
the whole network with the full information. 

Definition 2 (Approximate Universal Optimality). An approximate universally optimal strategy with partial 
information at nodes (Ni at transmitter i, N- at receiver i and SI at all the nodes) is defined as the strategy 
that each of the transmitter i uses based on its local information N^ and side information SI, such that 




sum 
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following holds. There exist a sequence of codes having rates Ri at the transmitter i such that the error 
probabilities at the receivers, Xi{n), ■ • ■ Xxin), go to zero as n goes to infinity, satisfying 

i 

for all the sets of network states consistent with the side information. Here Cgum is the sum-capacity of 
the whole network with the full information and r is a fixed constant independent of the channel gains. 

Thus, an (approximate) universally optimal strategy is one where for all network states consistent with 
the side information, decisions based on local information and the side information lead to (approximately) 
globally optimal solutions. We will use the notion of universal optimality for the deterministic model. 
Since we do not know the exact capacity region for the Gaussian interference channel, the notion of 
universal optimality will be replaced by approximate universal optimality. 

In this paper, we will assume that the local information at the nodes is obtained by a message passing 
protocol when run for d or d.b rounds for d > 0. In the case of d.b rounds, the last 0.5 round of 
message passing represents the message from the transmitters to the receivers but no message in the 
reverse direction. This is to ensure that the receivers know more than the transmitters so that reliable 
decoding can take place. 



IV. Two User Z-channel 

The smallest possible network of interest is a two-user interference channel. There are three network 
connectivities in this case, a fully connected bi-partite graph (interference channel), a Z-channel (one cross- 
link is missing) and two decoupled flows (two point-to-point links). We assume that all the nodes know 
that there are a total of two nodes in the network, or SI = {K = 2}. From the message passing protocol 
in Figure 1, it is clear that 1.5 rounds are sufficient for every node in the network to learn the whole of 
the state information for both fully connected bipartite graph and two decoupled flow connectivities. So 
the strategies decided with local view result in globally optimal strategy decisions. Hence we will focus 
our interest on the Z-channel, where 1.5 rounds do not result in full information at all the nodes; T2 does 
not know Hn after 1.5 rounds. 



A. Deterministic Z-Channel 

In a deterministic Z-channel, K — 2 and 7121 = 0. Specifically, the received signal Yji, j — 1,2, of a 
Z-channel is given by 

Yu = S^-'^^Xh (6a) 
Y2i = S™Xu®S"'X2i (6b) 

The network state message passing is described in Figure 1. After two full rounds, every node in the 
network knows the complete network state, i.e. the matrix H is known completely to all four nodes. In 
this case, the achievable capacity region is also known exactly [5, 11, 12]. 

Theorem 1 ([5, 11, 12]). The deterministic channel capacity region for a two-user Z-channel is the set of 
nonnegative rates (i?i,i?2) satisfying 

Ri < nil 
R2 < n22 

R1 + R2 < max(n22,ni2,nii,nii + n22-ni2) (7a) 

Since our main interest is in the case of partial information, we ask if fewer than two full rounds 
suffice to achieve sum capacity. The following theorem proves that 1.5 rounds are sufficient to achieve 
sum capacity for all two-user H, i.e. a universally optimal rate allocation exists for all H. Note that we 
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assume that the network size of = 2 is known to all the nodes. Further, in 1.5 rounds of message 
passing all nodes know that ?t,2i = 0. Thus, all nodes know that the connectivity is that of a Z-channel. 

Theorem 2. The sum capacity for a Z-channel can be achieved without completing the full message 
passing algorithm. To be precise, only the first 1.5 rounds are required to achieve the full-knowledge 
sum-capacity with the side information that the network size is K = 2. 

Proof: With the side information and 1.5 rounds of message passing, all the nodes know if they 
are the top user or the bottom user of the Z-channel. In Appendix A, we show that each transmitter can 
use only their local view to decide their rate and codebooks such that full-knowledge sum-capacity is 
achievable. ■ 

Corollary 1. There exist a universally optimal strategy with the local information iVj and N- provided 
by 1.5 rounds of message passing and the side information that K = 2. 

Proof: With 1.5 rounds of message passing and side information K = 2, all the nodes know the 
network connectivity. For all the connectivity choices, a strategy can be found by the nodes based on 
the channel gains they know that would achieve the full-information sum-capacity. For example, if the 
network is fully-connected, the nodes can use the node indices to order themselves since the labels Tj are 
unique and thus the nodes can compute an optimal strategy. If the network is a Z-channel, the strategy 
of Z-channel described in the proof of Theorem 2 can be used. ■ 

Thus, there is no loss in the performance even if the second transmitter T2 does not know about the 
whole network state and schemes with partial information exist which are universally optimal for all 
Z-channel H G Q{Ez), where Ez is the is the Z-channel connectivity. It is very instructive to closely 
study the structure of the rate allocation scheme. 

Since transmitter J 2 does not know the direct channel Hn, it in fact chooses to ignore the presence of 
the other transmitter. As a result, J 2 acts in a greedy fashion and sends at full rate of H22 bits. On the 
other hand, Ti knows the whole network and that it interferes with D2. It can then choose a transmission 
scheme which sends information below the noise floor of T2's transmission and if possible, above T2's 
signal. Figure 2(a)-(b) depicts the allocations for the case of deterministic channels. 




(a) (b) (c) 

Fig. 2. (a) and (h): Allocation for two possible deterministic Z-channels, and (c) corner point achieved by the rate allocation with partial 
information. [Note: solid red lines mean levels in use and dashed lines imply levels which are turned off.] 



The schemes which achieve optimal sum capacity follow two rules. First, if a transmitter does not have 
enough information about other links, it acts greedily and sends at its maximum possible link rate. This 
is the case for the T2, which does not know about Ti Di link. Second, if the link does know who it 
is causing interference to and by how much, then it ensures that it only sends at rates and powers, which 
do not impede on the success of other flows. This is the case for transmitter Ti, which knows that it 
is causing interference at receiver D2. In short, the transmitters act greedily and politely by maximizing 
their individual rate but constraining themselves not to hurt other flows about which they have sufficient 
information. 
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We next show that the rate allocation in the above strategy is the only unique possible way to get a 
universally optimal strategy with 1.5 rounds of message passing. 

Theorem 3. For the network connectivity of Z-channel, there exists a unique distributed rate allocation 
strategy that is universally optimal with 1.5 rounds of message passing protocol and is given by the 

strategy of Theorem 2. 

Proof: To show the uniqueness, we first consider the strategy of the second user. Since the second 
user does not know nn I2 has to transmit at ^22 to avoid being sub-optimal for the case of rin = 0. This 
implies that irrespective of the channel gains ^22 and ni2 no strategy that involves the rate of the second 
user other than 7222 can be universally optimal. 

Now given that the above T2 strategy of sending at full rate, the first user cannot assign any other rate 
than in the proof of Theorem 3, since only this rate would result in the optimality of the sum rate given 
that the second user is sending at 7122- ■ 

Perhaps the most interesting aspect of Theorem 2 is that locally optimal rates achieve globally optimal 
sum capacity. The reason is that the proposed distributed scheme achieves a comer point on the capacity 
region of Z-channel, as shown in Figure 2(c), which explains the uniqueness as shown in Theorem 3. 
Thus an interesting observation is that corner points need less information about the network at some of 
the nodes. We will observe this fact again for a bigger network in Section V. 

B. Gaussian Z-Channel 

In a Gaussian Z-channel, K — 2, and /i2i = 0. The received signal Yji, j = 1, 2, of a Z-channel is 
given by 

Yii = huXu + Zu (8a) 

Y2i = huXu + h22X2i + Z2i. (8b) 

Also, let SNRj = i e {1, 2} and INR2 = |/ii2p- We have assumed that all hifs are real and positive 
and hence use ha = y^SNR^, hi2 = \/INR^. 

After two full rounds, every node in the network knows the complete network state, i.e. the matrix H 
is known completely to all four nodes. In this case, an upper bound on the capacity region is given in 
the following Theorem. 

Theorem 4 ([4, 16-18]). The channel capacity region for a two-user Gaussian Z-channel is upper bounded 
by the region formed by the set of nonnegative rates R2) satisfying 

i?i < log(l + SNRi) (9a) 

i?2 < log(l + SNR2) (9b) 

7/INR2 < SNRi, 

Ri + R2< log (1 + SNRi) + log (^1 + jf^) • (10) 

//INR2 > SNRi, 

i?i + i?2 <log(l + SNR2 + INR2). (11) 

We now focus on 1.5 rounds of message passing in Gaussian channel. In Section IV- A, we showed 

that for deterministic Z channel, the sum capacity can be achieved with 1.5 rounds of message passing. 
We now show that sum capacity within 2 bits can be achieved for a two-user Gaussian Z-channel with 
1.5 rounds of message passing. Thus, there exist an approximate universally optimal strategy with r = 2. 
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Theorem 5. The sum capacity for a Gaussian two user Z-channel can be achieved within two bits with 
the local information N.-, and N[ at the nodes obtained with 1.5 rounds of message passing and the side 
information about network size, i.e, SI = {K = 2}. 

Proof: With 1.5 rounds of message passing, the two nodes know the network connectivity, i.e, if they 
are the first user (that causes interference to the other) or the second. 
The second transmitter uses a codebook of rate 

i?2 (SNR2, INR2) = log (1 + SNR2 (1 + INR2) / (1 + 2INR2)) , (12) 

with a power level of P2 = 1 to transmit. The first user however uses a common and a private message 
with rates 

i?i,c (SNRi,SNR2,INR2) 

fO iflNR2<SNR2 



= < 



log (1 + min fsNRi, 1^k%k, 11 if INR2 > SNR2, INR2 > SNRi (13) 

\ \ ^+ 1+SIMR2+21NR2 / / 

log (1 + l+2INR.+'s'Nt(l+INR.) ) if INR2 > SNR2, INR2 < SNRi 



(14) 



(SNRi,SNR2,INR2) 

if INR2 > SNR2,INR2 > SNRi 

log (1 + SNRi/ (1 + INR2)) otherwise 

Further, the power levels of 

'0 iflNR2<SNR2 

Pi,c(SNRi,SNR2,INR2) = <( 1 if INR2 > SNR2, INR2 > SNRi (15) 

INR2/(1 + INR2) if INR2 > SNR2, INR2 < SNRi 

P rQMR QMR IMR^ if INR2 > SNR2, INR2 > SNRi 

Pi„(SNRi,SNR2,INR2) = |^^^^^l^^^^ otherwise ^^'^ 

are used for the common and private parts, respectively. The common and the private parts are added 
together and transmitted. 

In Appendix B, we prove that the above rates can be decoded and yield the sum capacity within 2 bits. 

■ 

Corollary 2. There exist an approximately universally optimal strategy with t — 2 with the local 
information and N- provided by 1.5 rounds of message passing and the side information that K — 2. 

Proof: With 1.5 rounds of message passing and side information K — 2, all nodes know the global 
connectivity. For all the connectivity choices, a strategy can be found by the nodes based on the channel 
gains they know that would achieve the global information sum capacity within 2 bits. For example, if 
the network is fully-connected, then the nodes can use the node identities to order themselves and can use 
an appropriate (approximately) optimal strategy. If the network is a Z-channel, the strategy of Z-channel 
described in Theorem 5 can be used. ■ 
For a general Gaussian Z-channel, the capacity region is not known exactly. There exist achievable 
schemes that can achieve the region as shown in Figure 3. The achievable point corresponding to maximum 
R2 when Pi = log (1 -I- hli) is known exactly [18] but the maximal rate point corresponding to P2 = 
log(l -I- hl2) is only known approximately. Here, we achieve a point approximate to the maximal rate 
point corresponding to P2 = log(l + hl2)- 
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i?2 

log(l + hl^) 



Ir ner Bound 




achievable sum, 1.5 rounds 



Outer Bound 



Fig. 3. Outer and Inner Bounds in Gaussian Z-channel. 



While there is no loss in rate due to lack of knowledge in the deterministic Z-channel, it is no longer 
true for the case of Gaussian Z-channel. For example, exact capacity region is known for some regimes, 
like the strong interference case. However, the optimal sum rate point is not achieved by the distributed 
scheme with 1.5 rounds of knowledge. In this case, lack of full knowledge at J 2 implies it is not aware 
that it is part of a strong interference channel. As a result, the T2 backs off a little on the rate thinking 
that it may not be able to cancel all interference. 



V. Three User double Z-channel 

In this section, we will describe our results for three user double Z-channel. We will first find the 
capacity region for a general class of deterministic channels in which the interference at the receivers 
is a deterministic function of the inputs and the received signal is a deterministic function of the direct 
signal and the interference, and then specialize it to the deterministic model of Section II. We then derive 
new genie-aided outer bounds for the Gaussian double Z-channel. We will further provide an achievable 
strategy with 1.5 rounds of message passing algorithm which achieves the sum capacity in some cases 
and has an unbounded gap in certain other cases. We also show that the loss is unavoidable in the sense 
that there does not exist any strategy with local information that can be universally optimal even with 
additional side information of network connectivity, SI = E3Z (Esz is the network connectivity representing 
three- user double Z-channel), which is more than the side information SI = {K = 3}. The achievability 
is extended to the Gaussian model. Further, we show that 1.5 round strategy can be tweaked for the first 
user to derive a 2.5 strategy which is optimal for the deterministic case and is approximately optimal in 
terms of the sum capacity for the Gaussian case with side information SI = {K = 3}. This also proves 
that there exist a universally optimal/approximately universally optimal strategy with the local information 
obtained by 2.5 rounds of message passing and the side information of {K = 3}. That is because with 
2.5 rounds of messages, in all network connectivities except double-Z channel, all the nodes would know 
the whole state in 2.5 rounds and can hence take the same decision as the centralized optimal solution. 

A. Channel Models and Messaging Passing 

In a deterministic double Z-channel, K = 3 and ^13 = 77-21 = '^si = '^32 = 0. Specifically, the received 
signal Y^j, j = 1, 2, 3, of a double Z-channel is given by 

Y^i = S^-""Xi, (17a) 
Y2i = S™Xu®S™X2i (17b) 
F3i = S'^-^^^'Xs. © S^-"^3X3, (17c) 

In a Gaussian double Z-channel, and his = ^21 = ^31 = ^32 = 0. The received signal Yji, j = 1, 2, 3, 




of a double Z-channel is given by 



Yii = hiiXii + Zii 



(18a) 
(18b) 
(18c) 



Also, let SNRj = i G {1,2,3} and INRj+i = i E {1,2}. Further, we assume that all hi/s 

are real and positive. 

If the side information is SI = {K = 3}, message passing converges in 4 rounds. The details of the 
protocol are as follows (To simplify the notation, the node identities appended to each channel gain are 
not shown and implied by the channel subscripts.). 

1) Round 1: mk,i = ipk- Mi,i = {H^i}, M2,i = {1/12,^^22} and Ms,! = {1/23,^^33}. 

2) Round 2: mi,2 = {Hu, H12, H22}, ^2,2 = {H12, H22, H23, Hs^} and 7723,2 = (j). Mi,2 = 0, M2,2 = 
{Hn, H23, Hss} and Ms^2 = {^^12,^^22}. 

3) Round 3: mi,3 = {H23, H33}, 77^2,3 = {^^ii} and 7/13,3 = 0- ^1,3 = ^^2,3 = and M3,3 = {Hn}. 

4) Round 4: No new information is to be sent by any transmitter and hence the algorithm halts by 
transmitters sending a silent message 0. 

Note that Hij is replaced with riij in the deterministic or hfj for the Gaussian model. 

The local view of each node after 1.5 and 2.5 rounds of message passing is shown in Figure 4. It 
is clear that with fewer rounds of message passing, the local view of each node is less than full. The 
challenge for the node is that they have to make decisions on their transmission parameters based only 
on their local view. Thus, we will ask how close we can get to sum-capacity with 1.5 and 2.5 rounds 
of message passing. Before we derive sum-capacity with partial information, we will need full capacity 
region for the case of full information. In the next section, we derive the new capacity results. 



B. Capacity Regions With Full Information 

The capacity region for a general three-user interference channel is open. It has been solved in certain 
special cases of deterministic model in [11]. In this section, we provide the capacity region for a three 
user double-Z channel, which has not been considered before. 

Theorem 6 (Double-Z Deterministic Channel Capacity Region). The deterministic channel capacity region 
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for a three user double Z interference channel is the set of nonnegative rates {Ri, R2, R3) satisfying 

Ri < nu, i = 1,2,3 (19a) 

R1 + R2 < niax(nii,ni2,n22,nii + - ni2) (19b) 

R2 + R3 < max(n22 , ri23 , "-33 , ?^22 + riss - ^23 ) (1 9c) 
Ri + R2 + R3 < max(n33, 7123) + (riu - riu)^ 

+ max(ni2,n22 - ^23)- (19d) 

Proof: The proof is provided in Appendix C for a class of deterministic channels which is much 
broader than the deterministic model, along the lines of the class of deterministic channels for two user 
interference channels proposed in [12]. This region can be specialized for the deterministic model in (17a)- 
(17c) to get the above result. ■ 

Note that the bounds on Ri are the single user bounds, and the Ri + Rj bounds are due to the two two- 
user Z-channels, one consisting of {Ti,T2, Di, D2} and the other consisting of {T2,T3, D2, D3}. Finally, 
the sum bound Ri + R2 + R3 is due to common transmitter-receiver pair {T2, D2} in the two Z-channels. 

For the Gaussian channel, we provide an outer bound to the capacity region for the three user Z-channel. 
We divide the region of the channels to four cases depicting the strong/weak interference from the first 
two transmitters. 

1) INR2 > SNRi and INR3 > SNR2: In this case, an outer bound on the rate region is given as follows. 



Ri < log(l + SNRi) (20a) 

R2 < log(l + SNR2) (20b) 

i?3 < log(l + SNR3) (20c) 

R1 + R2 < log(l + SNR2 + INR2) (20d) 

R2 + R3 < log(l + SNR3 + INR3) (20e) 

2) INR2 > SNRi and INR3 < SNR2: In this case, an outer bound on the rate region is given as follows. 

Ri < log(l + SNRi) (21a) 

R2 < log(l + SNR2) (21b) 

R3 < log(l + SNR3) (21c) 

R1 + R2 < log(H- SNR2 + INR2) (21d) 

SNR-^ 

R2 + R3 < log(l + SNR2) + log(l+^-p^) (21e) 
Further, if (INR2 + 1)INR3 < SNR2 

R, + R2 + R3 < log(l + , ) + log(l + INR2 + SNR2) (22) 

i -|- MMK3 

else if (INR2 + 1)INR3 > SNR2 

R1 + R2 + R3 < log(l + INR3 + SNR3)+log(l + INR2). (23) 

3) INR2 < SNRi and INR3 > SNR2: In this case, an outer bound on the rate region is given as follows. 
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Ri < log(l + SNRi) (24a) 
R2 < log(l + SNR2) (24b) 
R3 < log(l + SNR3) (24c) 

R1 + R2 < log(l + SNRi)+logh + (24d) 

R2 + R3 < log(l + SNR3 + INR3) (24e) 

4) INR2 < SNRi and INR3 < SNR2: In this case, an outer bound on the rate region is given as follows. 

(25a) 
(25b) 
(25c) 



Ri 


< 


log(l + SNRi) 




R2 


< 


log(l + SNR2) 




R3 


< 


log(l + SNR3) 




Ri + R2 


< 


log(l + SNRi) 


+ log (^1 + 


R2 + R3 


< 


log(l + SNR2) 


+ log ( 1 + 



SNR 



2 



I + INR2 

SNR3 
I + INR3 



(25d) 
(25e) 



Further, if (INR2 + 1)INR3 < SNR2 



R, + R, + R3 < Ml + SNRO + log (1 + ^^)+ log (1 + ^^) (26) 

else if (INR2 + 1)INR3 > SNR2 

R1 + R2 + R3 < log(l + SNRi) + log(l + INR3 + SNR3). (27) 

Theorem 7 (Double-Z Gaussian Channel Outer Bound). The capacity region of Gaussian double Z-channel 
is outer bounded by the region formed by R2, R3) satisfying (20)-(27). 

Proof: All the single user bounds and the bounds on Ri + R2, R2 + R3 follow from the two user 
Z-channel. The new bounds for Ri + R2 + R3 in the second and the fourth cases are new, and are shown 
in Appendix D. ■ 
We note that in the case of weak interference for the two Z-channels, (Ti, T2, Ri, R2) and (T2, T3, R2, R3), 
it is not always optimal to treat interference as noise unlike the case in two-user Z-channel. However, 
there is a region in which the interference is very weak which is when INR3(INR2 + 1) < SNR2 where 
treating interference as noise is optimal. 

Lemma 1. Let K > 3 and consider a symmetric K user Z-channel, where (K-1) Z-channels are stacked 
one over the other, with SNR^ = SNR and INRj = INR. V<Ihen INR(INR + 1) < SNR, the sum rate is 
outer bounded by log(l + SNR) + [K — 1) log (l + Y^nt^)- ^^"■^ ^^'■^ regime, the sum-capacity can be 
achieved by treating interference as noise. However as we saw in the case of 3-user system, the above is 
not true in general for INR < SNR < INR(INR + 1)). 

Proof: This proof follows using the same techniques as in the special case of very weak interference 
in Appendix D and is thus omitted. ■ 



C. Deterministic Model: 1.5 Rounds 

In this section, we will study the achievable sum-rate after 1.5 rounds of message passing. Recall that 
the local view of each node is given in Figure 4(a) and the nodes have to base their choice of rates 
and transmission strategy only on their local views. We first describe an achievable rate, building on the 
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Z-channel allocation strategy discussed in Theorem 2. Note that to derive the achievable rate, we will 

only assume side information about network size, i.e K = 3. However, we will show the converse for 1.5 
rounds with extra side information about connectivity SI = {Esz}, which makes our converse stronger 
than one with only network size as side information. 

1) From the point of view of the first transmitter, it knows the upper part of the Z-channel (nn, ni2, 
^22) and only extra cross link possible is a link from the second transmitter to the third receiver. 
Thus, the first transmitter knows that it is a Z-channel with the third user alone or a double-Z 
channel. Since Ti does not know which of the two cases are applicable, it assumes the worst case 
that second transmitter T2 will send at full rate and acts as if it is a Z-channel consisting of the first 
two users only. Note that this strategy is optimal if network connectivity turned out a Z-channel 
with a decoupled transmitter T3, instead of double-Z channel. Further this rate can be decoded at 
the receiver even if it is a double Z-channel. 

2) From the point of the second transmitter, it knows riu, ^22> ^^23 and 7133. Hence, the only cross link 
possible that it does not know is risi. Note that while 7131 is actually zero, the second transmitter 
does not know about it. Now suppose that the second transmitter sends as if there are only two 
users (the second and the third) and hence functions as the upper user of the Z-channel. It assumes 
that if it is a double-Z, the first transmitter Ti will send as if the second transmitter T2 is sending at 
full rate and thus will take care of interference itself. Hence the data can be decoded at the second 
receiver D2. If the network connectivity is cyclic with the presence of risi, then every user will use 
this strategy and back off and thus the data can still be decoded. 

3) The third transmitter T3 knows 7233 and 7223. Thus the cross channel gains that the third transmitter 
does not know about are 7712 and 7721. Even though 7721 = 0, the third transmitter does not know 
about it. The third transmitter transmits at 7733. In case both the links 7112 and 7121 are present, the 
second transmitter knows the whole state and can shut down or select the rate since it would know 
the strategy of other users. In case only n2i is present, then the second transmitter knows it is one- 
to-many configuration and there exist an optimal strategy in this case where the first and the third 
user send at full rate while the second transmitter backs off. The only other case is the double-Z 
channel where we use this strategy and show that it is still achievable. 

To summarize, this achievability scheme reduces for a double Z-channel to using the same strategy 
at all the users as in the two user Z channel with the relevant knowledge. More specifically, the first 
transmitter assumes that it is the top transmitter in a two user Z channel (consisting of the first and the 
second user), the second transmitter assumes that it is the top user of a two user Z channel (consisting of 
the second and the third user) and the third transmitter assumes that it is the bottom user of a two user 
Z channel (consisting of the second and the third user). 

Theorem 8 (Achievable Rate with 1.5 Rounds). The above scheme can achieve the following sum-rate 
with 1.5 rounds of message passing for a double-Z channel: 

min(max(7722, ^123, ?T'33, ^^22 + ?T'33 - n2z),n22 + ^133) 

-Fmin(77ii,max(77ii,77i2) - min(77i2, 7722))- (28) 

Proof: We show that each transmitter uses network state information obtained from the first round to 

decide the transmission strategy. The third transmitter sends at full rate, 7^33. The second transmitter does 
not know rin and uses the strategy as if it was a Z-channel consisting of only the second and the third 
user. If 7723 < ^337 the second transmitter will send at a rate of (7722 — ^23)^- If ^^23 > ^33, the second 
transmitter sends at a rate of min(max(7722, ?T'23) — ?T'33, ^122)- Thus, the second transmitter sends at a rate of 
min (7722, max (7722,^23) - min(7723, ^33)) = min(max(7722, ^23, ^33, 7722 + ^33 - 7123), ?T'22 + ?T'33) -^33- The 
first transmitter transmits as if it was a Z-channel consisting of first two users and considering that the 
second user sends at a rate of 7722 and hence sends at a rate of min(77ii,max(77ii,77i2) — min(77i2, 7722))- 
Hence, the above sum rate can be achieved. ■ 
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Fig. 5. Loss in spatial reuse due to distributed decisions. 



So, the obvious next question is how well does the above scheme perform compared to the sum-capacity 
in Theorem 6. We show that the gap from the sum-capacity can be anywhere from zero to arbitrarily large. 
First, we classify all those network states in which the above distributed scheme achieves sum-capacity. 

Theorem 9 (Achieving sum-capacity). The sum capacity can be achieved with 1.5 rounds of message 
passing for a double-Z channel if any of the following are true 

• ^23 > ^22 + 

• ni2 > rill + n22 

• ^23 < "-33 and n22 > n23 + ni2 

Proof: 

• n23 > n22 + ^^33: In this case, (28) reduces to the sum rate of 7133 -|- min(nii -|- n22, max(nii, 7122, 
^125^11 + ^^22 — ^^12)) which is optimal since the upper bounds on and Ri + R2 in Theorem 6 
are identical to the above expression. 

• ^12 > nil + ^22: In this case, (28) reduces to the sum rate of riu + min(max(n22, 'n.23, ^1-33, ^1-22 + 
^33 — ^23), ^^22 + ^33) which is optimal since it is identical to upper bounds on Ri and R2 + -R3 in 
Theorem 6. 

• ^^12 < ^22 and ^23 < ^33 and 7222 > ^23 + ^^12: In this case, the achievable sum rate and the outer 
bound both reduce to 7222 + ^33 — ^23 + (^11 — '^12)^- Hence, the sum rate is optimal since this 
matches the Ri + R2 + R3 outer bound in Theorem 6. 

■ 

We next construct an example where the loss can be arbitrarily large. Let 77ii = 7712 = 7722 = ^23 = 
7733 = X. The distributed scheme achieves a sum-rate of x related to rate-tuple of (0,0, x) as shown in 
Figure 5(a). However, the rate tuple (x, 0, x) is in the capacity region and relates to the rate allocation 
shown in Figure 5(b). By taking x large enough, the achievable sum rate can be made arbitrarily far from 
the outer bound. Thus the above distributed strategy is not universally optimal, i.e, it does not achieve 
sum-capacity in all network states. The next result shows that no distributed scheme can be universally 
optimal with 1.5 rounds of information, even with extra side information. 

Theorem 10 (Loss is Inevitable). There exist no universally optimal strategy with the local information 
obtained by 1.5 rounds of message passing for double-Z channel even with the side information SI = E^z, 
the network connectivity of double-Z channel. Thus, there is no universally optimal strategy with the local 
information obtained by 1.5 rounds of message passing and the side information of K = 3. 
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Proof: We first assume that the each node is given the information that the network connectivity is 

a double-Z channel and the information about its relative placement in the network. So, the nodes can 
choose strategy tailored for double-Z channel. Thus, the set of network states consistent with the local 
information only have nn, 7122, n^s, and 7123 as parameters. 

We will now prove the Theorem by contradiction. Suppose that there is a universally optimal strategy. 
Since the third transmitter does not know n22 and riu and if riu = n22 = 0, the only rate allocation that 
can be used by the third transmitter that would be optimal is = 7233. Hence, if there exists a universally 
optimal strategy, = 7733. (Even though the transmitter knows 7123, it cannot use this extra knowledge 
to make a decision on the rate allocation.) 

The second transmitter does not know nu. Hence, if there exist an universally optimal strategy, it 
should work even if nu = 0. For nu = and -R3 = ^33, the only way the second user can use optimal 
rate allocation is to transmit at a rate as in the proof of Theorem 8. 

The first transmitter does not know 7133. Hence, its optimal strategy should work even if n^s = 0. If 
7133 — 0, R2 — 7i22 and hence the first user will have to transmit at a rate as in Theorem 8. 

Thus, we see that if there exist an universally optimal strategy, the rates of the users have to be the 
same as in Theorem 8. We note that the sum rate in Theorem 8 is not optimal in general and hence leads 
to a contradiction. Thus, there is no universally optimal strategy. ■ 

Theorem 10 is the key result in the paper. It shows that no distributed scheme, which only relies on its 
local view after 1.5 rounds, can be guaranteed to be globally optimal for all channel conditions even with 
the global knowledge of connectivity. Distributed schemes can be optimal for some values of network 
matrices H e G{^3z) but not all of them simultaneously. Thus, there is no universally optimal scheme 
with the local information provided by 1.5 rounds of knowledge and the side information of network 
connectivity. 

The fundamental reason for this unavoidable loss for some channel gains is severely incomplete view of 
the network at different transmitters. As shown in Figure 5, the loss in spatial reuse is due to mismatched 
knowledge in the deterministic double-Z channel. Here Ti is backing off for T2, which in turn is backing 
off for T3. In this example, Ti could have sent at full rate but ended up being too conservative due to 
its lack of knowledge about state of T3, as shown in Figure 5(b). As a result, Ti tailors its action to 
the worst case scenario, which is T2 sending at full rate. The reader is reminded that each node is only 
allowed to adapt its transmission based on its own local knowledge and cannot base its decision on what 
is not known. Thus, once a node defines a rate allocation policy based on its local information (e.g 1.5 
rounds), it has to use the same allocation for all the possible values of other channel gains that are not 
known with the local and side information. 

We observe that the above proof can also be extended to show that there does not exist any strategy 
that will perform within a bounded gap (independent of channel gains) from the optimal sum capacity. 
Thus, an approximately universally optimal strategy also does not exist. To prove that claim, assume 
nij = CijL where L can be taken as large as possible, Cij > 0. With the local channel knowledge at the 
nodes, Rs > CijL — Q(l) (where 6(1) represents a function that is independent of L) since that is the 
only way an approximately optimal strategy can exist when C22 = Cn = 0. Given this strategy of the third 
user, R2 > L min(c22, max(c22 — C33, C23 — C33, C22 — C23)"'") — 0(1) since otherwise the strategy will not be 
approximately universally optimal for cu = 0. Similar strategy goes for the first user. We can also show 
that this strategy will be unbounded rate away from optimal as L ^ 00, thus proving non-existence of 
approximately universally optimal strategies. 

D. Gaussian Channel: 1.5 Rounds 

The achievability strategy for Gaussian channels follows the same technique as in the deterministic 
model. As mentioned for the deterministic model, the achievable scheme used by each of the users will 
work as an achievable strategy for all possible global network connectivities if they see the same local 
network connectivity. The details are as follows. 
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The third transmitter makes a codebook of rate 

i?3(SNR3, INR3) = log (1 + SNR3(1 + INR3)/(1 + 2INR3)) , 



(29) 



and uses a power level of P3 = 1 to transmit. The second user however uses a common and a private 
message with rates 



i?2,c(SNR2,SNR3,INR3) = 


log (^1 + 

log (1 + ]^_,_2|NR2 H-2INR3+SNR3(1+INR3) 



I+INR2 INR§ 



INR3 
SNR3INR3 
+SNR3+2INR3 



if INR3 < SNR3 

if INR3 > SNR3, INR3 > SNR2 (30) 
if INR3 > SNR3,INR3 < SNR2 



i?2,p(SNR2,SNR3,INR3) 
'0 

.log (l + rSSNR2/(l + INR3)) 
Further, the power levels of 



if INR3 > SNR3,INR3 > SNR2 
otherwise 



P2,c(SNR2,SNR3,INR3) 



INR3/(1 + INR3) if INR3 > SNR3, INR3 < SNR2 



(31) 



if INR3 < SNR3 

if INR3 > SNR3, INR3 > SNR2 (32) 



P2,p(SNR2,SNR3,INR3) 



if INR3 > SNR3,INR3 > SNR2 

1/(1 + INR3) otherwise 



(33) 



are used for the common and private parts. The common and the private parts are added together and 
transmitted. 

The first user uses a common and a private message with rates 



Pi,e(SNRi,SNR2,INR2) 
'0 

log ( 1 + min(SNRi, 



^log( 



1 + 



INR 



INR2 

. SNR2INR2 
1+SNR2+2INR2 



1+2INR2+SNR2(1+INR2) 



if INR2 < SNR2 

if INR2 > SNR2,INR2 > SNRi 
if INR2 > SNR2,INR2 < SNRi 



Pi,p(SNRi,SNR2,INR2) 


log (1 + SNRi/(l + INR2)) otherwise 
Further, the power levels of 



if INR2 > SNR2,INR2 > SNRi 



Pi,e(SNRi,SNR2,INR2) 



if INR2 < SNR2 
if INR2 > SNR2,INR2 > SNRi 
INR2/(1 + INR2) if INR2 > SNR2, INR2 < SNRi 



(34) 



(35) 



(36) 
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are used for the common and private parts. The common and the private parts are added together and 
transmitted. 

We will show in Appendix E that the above rates can be decoded by the receivers. We now see the 
various cases when the achievable strategy will be bounded distance from the sum-capacity. 

Theorem 11. The sum-capacity within 4 bits can be achieved with the first full round and half of the 
second round of message passing when any of the following conditions hold 

1) INR3 > SNR3, ^^^.K, > SNR2 

' i SNR-i I 2INR3 

2) INR2 > SNR2, ^ , ^:^^.R, ' > SNRi 

' 1+SNR2+21NR2 

3) INR2 < SNR2 and INR3 < SNR3, SNR2 > INR3(INR2 + 1) 
Proof: 

1) INR3 > SNR3, ^ jK'R^K, > SNR2: In this case, R2 = log (l + ^i;^SNR2) which yields 

1+SNR3+21NR3 ^ / 

-Ri + R2 within 2 bits of optimal as in the two-user Z-channel and since is within 1 bit of 

log(l + SNR3), Ri + R2 + R3 is within 3 bits of the outer bound. 

2) INR2 > SNR2, snr1inr2 - > SNRi: In this case, Ri = log(l + SNRi). Further, R2 + R3 is within 

1+SNR2+21NR2 

2 bits of that in the case of two-user Z-channel which is further within 2 bits of optimal; thus 
leading to the sum rate within 4 bits of optimal. 

3) INR2 < SNR2 and INR3 < SNR3, SNR2 > INR3(INR2 + 1): The achievable sum rate in this case 
reduces to 

, / SNRi \ , / I + INR2 SNR2 \ , / I + INR3 

log 1 + — + log 1 + , \,„ ' - — + log 1 + SNR3: ' 



l + INR2y °V l + 2INR2l + INR3y °V I + 2INR3, 

If INR2 > SNRi the outer bound is log (1 + SNR2 + INR2) +log (l + jfj]^). However, if INR2 < 

SNRi, the outer bound is log(l + SNRi) + log (l + ^j^^ + log (l + xfyl^). We note that in 
both these cases, the achievability is within 4 bits in both the cases. 

■ 

As in deterministic model, there are cases when the sum rate achieved with 1.5 rounds can be arbitrarily 
far from the optimal. As an example, consider INR2 = SNRi = SNR2 = INR3 = SNR3 = x. The 
achievable sum rate is log[(l -|-6a; + llx^ -f-Sa;^) /{l + 3x + x'^)]. However with full information, rate pair of 
(log(l-Fx),0,log(l-Fa;)) can be achieved. For x > 2, log[(l-F6x-Mla;2-F5a;^)/(l-F3a;-Fa;^)] < 21og(l-Fx) 
and the difference grows unbounded with x thus proving that the difference between achievability and 
outer bound can be unbounded in some cases. 

We also note that the strong converse mentioned for the deterministic case also holds in the Gaussian 
case that there is no approximately universally optimal strategy with 1.5 rounds of message passing. Since 
the proof uses the same ideas, it is omitted. 



E. Both Channels with 2.5 Rounds 

We first note that with 2.5 rounds of message passing, all the nodes know the state information except 
the third transmitter which does not know Hu. Thus, the side information of SI = {K = 3} with 2.5 
rounds of message passing and the side information of network connectivity, SI = {E}, with 2.5 rounds 
of message passing are equivalent. 

We first consider the capacity region of the deterministic double-Z channel as shown in Figure 6. 
The region consists of single user constraints, the constraints on Ri + R2, R2 + Rs and a constraint on 
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Ri + R2 + Ri depicting the various planes in the figure. Note that there is a segment on the optimal face 
of i?i + i?2 + R3 that has Rs = 77,33 marked in the figure. Any point on this segment can be achieved 
with 2.5 rounds of message passing. This is because the third transmitter sends at a rate of 7233 while the 
first two transmitters know all the channel gains to select the policy to operate at any point on this line. 

Thus, there exist universally optimal strategy with the local information at the nodes obtained by 2.5 
rounds of message passing and the side information about network size SI = {K = 3}. So with 2.5 
rounds, we can prove existence of a universally optimal strategy with less side information compared to 
the converse for 1.5 rounds in Theorem 10. 

Theorem 12. There exists a universally optimal strategy with the local information at the nodes obtained 
by 2.5 rounds of message passing when each node is provided the side information that there are only 
three nodes in the network. 

Proof: With 2.5 rounds of message passing, all the nodes would know the network connectivity in 
their connected component. In all the cases except the double-Z channel, aU the nodes would know all the 
channel gains of the connected component in which they are and hence can use an optimal strategy by 
ordering the nodes based on the node identities and the network states. For double-Z channel connectivity, 
the strategy described before can be used to get a universally optimal strategy. ■ 




Fig. 6. Capacity region of Deterministic double Z-channel. 



We now consider Gaussian channel model. As we saw in the deterministic model that any strategy 
following the rate allocation on a line is optimal, we consider one of the corner point on the line 
corresponding to higher R2 and provide an achievability strategy that is within 4 bits of sum capacity 
for 2.5 rounds of message passing. In this strategy, the second and the third user uses the same policy 
as with 1.5 rounds and therefore do not change the strategy. However, the first transmitter knows all the 
channels and hence changes its strategy. 
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The first user uses a common and a private message with rates 

'min(log(l + SNRi), log(l + INR2 + SNR2) - R2) 



SNR2(1+INR2)+INR2> 



min(log(l + SNRi), log(l + INR2) - R2 
min(log(l + T^^),log(l + 
min(log(l+ 4«,J ,log(l 
min(log(l + SNRi), log(l + INR2 + ,^,^^3 
-in(log(l + ,J»^),log(i + ^' 



1+21NR2 

INR| ' 

1+2INR2 ' 
SNR2 



-R2) 



-R2) 
-R2) 
-R2) 



if §>>> 

if >>>< 

if !><> 

if >>« 

if 

> 



l+SNRi+INR2^' ""^^V-" ' I+2INR2' 

min(log(l + SNRi), log(l + INR2 + ^-^^) - R2,c, 
log(l + INR2 + x^) - R2,p, log(l + INR2 + SNR2) - R2) 
mmnngn I '^^^ ^ IncrM I INRi+SNR2lNR3(l+INR2)/(l+INR3) x p 

min(log(i + i+2iNR2'''^°gv^ 1+21NR0 ) ~ -"2,c, 



if >« 



~< (38) 
if <»< 



2" I+2INR2 

log(l + 'NRi+SNR2(l+INR2)/(lH-INR3) ) _ ^^^^^ ^^g^, ^ 



INR|+SNR2(1+INR2) 



1+21 NR2 

min(log(l + 1 , 2iNR„ ),log(l + 



INR^+SNR2(l+INR2] 



I+2INR2 



) - R2) if <>« 



)-R2) 



if «<| 



I+2INR2/' 6V ' I+2INR2 

where the four inequalities in the if condition represents the order in INR2 vs. SNR2, INR3 vs. SNR3, 

> 
< 



INR2 vs. SNRi and INR3 vs. SNR2 respectively. ^ represents that it can be either side of inequality. For 



example, <><> represents INR2 < SNR2, INR3 > SNR3, INR2 < SNRi and INR3 > SNR2. 

if INR2 > SNRi 



Ri,p - 



Further, the power levels of 



log(l + SNRi/(l + INR2)) iflNR2<SNRi 

1 iflNR2>SNRi 
INR2/(1 + INR2) iflNR2<SNRi 



(39) 



(40) 





1/(1 + INR2) 



if INR2 > SNRi 
if INR2 < SNRi 



(41) 



are used for the common and private parts. The common and the private parts are added together and 
transmitted. It is straightforward to see in all the cases that the above rates can be decoded by all the 
users. Further as shown in the next theorem, the above strategy achieves the sum capacity within four 
bits. 

Theorem 13. The strategy achieves a sum-rate that is within four bits of the full-information sum-capacity 
for all H e GiEsz). 

Proof: We will show in Appendix F that the sum-capacity within four bits can be achieved by splitting 
the channel gain regimes into 14 cases. ■ 

Corollary 3. There exists an approximately universally optimal strategy with t — A with the local 
information at the nodes obtained by 2.5 rounds of message passing when each node is provided the 
side information that there are only three nodes in the network, i.e. SI = {K = 3}. 

Proof: With 2.5 rounds of message passing, all the nodes would know the network connectivity in 
their connected component. In all the cases except the double-Z channel, all the nodes would know all the 
channel gains of the connected component in which they are and hence can use an optimal strategy by 
ordering the nodes based on the node identities and the network states. For double-Z channel connectivity, 
the strategy described in this section can be used to get approximately universally optimal strategy. ■ 
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VI. X-USER Z-NETWORK 

While exact analysis for two and three-user interference channels was tractable, extensions to general 
interference channel remains out of reach at the current moment. To make progress, we will consider 
a special case of channel gains which reduces the network parametrization to only three unknowns: the 
number of nodes in the network, and two channel gain parameters representing the direct link and the 
cross Unk. In addition, we will assume that the network connectivity is of the form of K — 1 Z-channels 
stacked on top of each other as described below. Our objective is to quantify the achievable sum-rates 
with limited number of rounds of message passing. 

A. Channel Models and Message Passing Protocol 

For the deterministic model characterized by the direct channel gain n and the cross channel gain m, 
the received signal l^j, j = 1, 2, • • • ,K of X-user Z-channel is given by 

= S^-"Xi, (42a) 
y^.. = S^-^^Xfj^i), © S^^"Xji for all 2<i<K. (42b) 

We label the top transmitter as Node 1, the next transmitter as Node 2 and so on. Thus, we would 
use the phrase "node above" and "node below" with respect to this labeling unless otherwise stated. We 
assume that all users knows that it is a symmetric Z-channel but do not know n, m and their relative 
position in the network (which also includes the information about K). In this case, the message passing 
protocol can be performed with less message content than sending the whole channel matrix as explained 
in Section III-C. We will now provide achievability with d rounds of message passing algorithm for 
d < K. Since the users do not know K and their placement in the network, the decisions have to be 
made in a distributed fashion. 

For the Gaussian model characterized by the direct channel gain VSNR and the cross channel gain 
ViNR, the received signal at receiver j at time i, Yj^, j = 1, 2, • • • ,K of K-user Z-channel is given by 

Yu = VsmXu + Zu (43a) 
Yji = V\mX^j_i)i + VsmXji + Zji for all 2<j<K. (43b) 

We also note that in this section, we will also sometimes consider d rounds of message passing protocol 
for integer d rather than d.5 rounds. This is because the additional symmetry in the network (all direct links 
are identical and all cross links are identical) allows the receiver to decide on their decoding schemes even if 
they have less knowledge of links than the transmitter. We assume that the nodes have the network connec- 
tivity and its reduced parametrization as side information, i.e, SI = {G', Ha — Hn, Ha+i — H12}, where 
F = {E : (T,,D,)} and G' = {H : H^j = if if j ^ t or j ^ t + 1, Hn = H22 = ' ■ ■ E {0,1, . . . ,q}, 
H12 = H23 = • • • e {0, 1, . . . , q}}. However, they do not know the network size K or the two channel 
gains {Hu,H2i}. 

B. Deterministic Z-Network 

We first consider the deterministic model and let a — m/n. We show that for a > 2 or a < 1/2, one 
round of message passing achieves the sum capacity in the following theorem. 

Theorem 14. For a >2 and a < 1/2, there exists a strategy which achieves optimal sum-rate with one 
round of message passing protocol for any K if each node knows the side information that the network 
state is symmetric and is parameterized by three parameters K, n and a — m/n. 

Proof: We consider the two cases in the statement of the theorem separately as follows. 
1) a > 2: Each transmitter knows that it is very strong interference channel in one round and hence 
the transmitters transmit at full rate. Further, the receivers (except the first) also know that it is very 
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Strong interference and are able to decode both the messages. The first receiver decodes the direct 

message assuming it is being sent at full rate. Thus, the sum capacity is achieved. 
2) a < 1/2: With 1 round of message passing, each node (including the receivers) knows if it is 
the top-most user or not. The top most transmitter sends at a rate of n. However, all the other 
transmitters know that there is interference at its receiver and hence backs off to send at the top 
{n — m) levels. Thus, the sum rate of n + {K — l)(n — m) is achieved. Let Vi be the interference 
produced by user i to user i + 1. Then, Yl!i=i H{Yi\Vi) + H(Yk) is an outer bound to the sum 
capacity and this is upper bounded by n + [K — l){n — m). Thus, the sum capacity is achieved. 

■ 

In Theorem 14, we proved that for a > 2 or a < 1/2, one round was sufficient for the existence of a 
universally optimal strategy. Further, one round is the minimum needed for the transmitters to know the 
rate at which they need to transmit. Note that the top receiver Di did not need to know a to decode since 
the transmitter sends at a rate of n which is known to the receiver. These regimes cover the very strong 
and the very weak interference. We next consider the case when the interference is strong but not very 
strong, and when the interference is weak but not very weak. In this case, we give a strategy that uses 
0{K) rounds to achieve the sum capacity. This achievable strategy uses 0{K) rounds because the nodes 
base their decisions on their placement in the network. More precisely, the nodes learn if they are the 
even-numbered nodes or the odd-numbered nodes in the network state to decide on the rate allocation. 

Theorem 15. 1) For 1/2 < a < 2/3, there exists a strategy (in terms of sum-capacity) which is optimal 
with one round of message passing for K <2> and with 2 rounds of message passing protocol for 
any K >?>. 

2) For 2/3 < a < 1, the sum-rate of n + [K — l){n — m) can be achieved with 1 round of message 
passing, which is optimal for K < 3. In general, the sum-rate of n + (K — l)(n — m) + (2m — 
''^)^i=i^^^'^^ i^K>2i+i) + (n — 'm)Yl\=i'^^^^^ {^K>2i+2) can be achieved in d.5 rounds. Thus, this 
strategy achieves optimal sum-rate for K > 3 in K.5 rounds. 

3) Let 1 < a < 2, then the sum-rate of n + (K — l)(m — n) + {2n — m) Y^\=i^^^'^\^K>2i+i) can be 
achieved in d > 1 rounds. This strategy is optimal with one round of message passing for K < 3. 
Further, this strategy is optimal for any odd K > 3 with K rounds, and for any even K, K > A 
with {K — 1) rounds. 

Proof: We divide the proof into four cases as below. 

1) l/2<Q;<l,(i=l: After one round, each node knows if it is the User 1 or not. The first node 
transmits at a rate of n while all other users send at a rate of n — m avoiding interference. This 
strategy will achieve the sum-capacity with one round of message passing for 7^ < 3. 

2) For 1/2<q;<2/3, (i = 2: Each node knows if there are at least 2 nodes above it after 2 rounds of 
message passing protocol. The top most transmitter (Ti) sends at a rate of n. The transmitter that 
has only one transmitter above it (T2) sends on the top n — m signal levels so that the signal can 
be received interference-free at the receiver. Each transmitter that has at least 2 transmitters above 
transmits at top n — m levels and the bottom 2m — n levels. Doing this, there is no interference at 
any of the users and a sum rate ofn-\-{n — m)-\- {K — 2)m can be achieved. The outer bound of 
Ylf=i H{Yi\Vi) + H{Yk) reduces in this case io n-\- [n — m) + {K — 2)m thereby proving that the 
optimal sum-rate is achieved. 

3) 2/3<Q;<l,(i>l:we first relabel the nodes from bottom to top as j = 1, ■ ■ ■ , i^T. We first 
consider a strategy with full information. Let the odd transmitters Tj with odd i e {3, • • • , K} 
send at top 2m — n levels and bottom n — m levels, while the even transmitters Tj with even 
i e {3, • • • , K} send on all the top n — m, bottom n — m levels but repeat the information of 
top min(n — m, 3m — 2n) levels that clashed with the information from transmitter Tj_i on levels 
n ~ m + 1 ton — m+l + min(n — m, 3m — 2n). An example for a = 3/4 is shown in Figure 
7. Transmitter T2 sends at bottom n — m levels and transmitter Ti sends at all the n levels. This 



27 



Strategy will achieve the sum capacity which can be shown as follows. If K is even, for all odd 
i E {1, ■ ■ ■ ,K}, Ri + is upper bounded hy 2n — m due to the Z-channel constraints and 
since our scheme achieves all these outer bounds, the sum capacity is achieved. If K is odd, for 
all even z G {4, ■ ■ ■ , K}, Ri + Ri^i is outer bounded hy 2n — m due to Z-channel constraint and 
Ri + R2 + R3 is outer bounded by 2n due to double Z-channel. Since all these inequalities are 
satisfied with equality with our achievable scheme, the sum capacity is achieved. 




Fig. 7. An example with m = 3, n = 4, a = 3/4. Let there be > 6 users and we consider users 4,5 and 6. 

The above strategy assumed the knowledge of the odd or the even numbering of the users to choose 
the strategy. With no information of the node being even or odd, each transmitter uses the strategy 
to avoid interference (send at n — m levels). However, when the node gets to know that it is odd/even 
numbered, it sends at the strategy of the odd/even node above. This needs larger number of rounds in 
this case. The third, fourth, fifth, sixth, ■ ■ ■ transmitters get to know that they are odd/even numbered 
and K > 3 after 3.5, 4.5, 5.5, 6.5, ■ ■ ■ iterations respectively thus proving the result. 
4) 1 < a < 2: We again use the above relabeling of the nodes from top to below as i = 1, - ■ ■ , K. 
We first consider a strategy with full information. Let the even transmitters among Tj with even 
i G {1, ■ ■ ■ , K} send at top m — n levels, while the odd-numbered users T, with odd z G {1, ■ ■ ■ , K} 
send on all the top n levels but repeat the information of min(m — n,2n — m) levels that clashed 
at receiver Dj+i on the lower levels (See Figure 8 for example). 

The strategy will achieve the sum-capacity which can be shown as follows. If K is even, then an 
outer bound can be given by genie-aided Z-channels as Ri + Ri^i < m for all 1 < i < and thus 
the sum-rate is upper bounded by Km/ 2 and we note that the above strategy achieves this outer 
bound. If K is odd, then for all all odd-numbered i E {I,- ■ ■ ,K — 1}, Ri + Ri^i is upper bounded 
by m and Rk is upper bounded by n by the point to point channel. Further, these conditions are 
satisfied with equality for our rate allocation. Thus, the sum capacity is achieved. 




Fig. 8. An example with m = 3, n = 2, a = 3/2. Let there be 5 users and we consider the top two users. 



The above strategy assumed the knowledge of the odd or the even numbering of the transmitters to 
choose the strategy. With no information of the node being even or odd-numbered, each transmitter 
uses the strategy to avoid interference (send at m — n levels). However, when the node gets to know 
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that it is odd-numbered, it users the strategy of the odd-numbered node above. This needs larger 

number of rounds in this case. The first, third, fifth, seventh, ■ ■ ■ transmitters get to know that they 
are odd-numbered and K > 3 after 1, 3, 5, 7, ■ ■ ■ iterations respectively and the nodes know that 
there are at- least two transmitters below it in 2 rounds, thus proving the result. 
Also note that the requirement of the number of rounds for X = 3, 4 can be reduced by 1 and for 
K > 5 can be further reduced by 2 by changing the strategy so that the bottom most node always 
send at rate of n and the second node from bottom sends at rate m — n. Thus, the sum capacity 
can be achieved with K — 2 and K — rounds for odd and even K respectively for K > A. Using 
similar counting of nodes from below also, the number of rounds needed can be further reduced, but 
still needing 0{K) rounds so that transmitters get to know even/odd placement information from 
the bottom or the top. 

■ 

For 1/2<q;<2/3, 2 rounds of message passing are enough since it is enough for the transmitters to 
learn if there are more than two nodes above it. However for 2/3 < a < 2, 0{K) rounds were needed to 
converge to the sum capacity in general so that the transmitters are able to know their placement in the 
channel connectivity. More precisely, the nodes learnt their relative position in the network being even/odd 
user. This requires 0{K) rounds of message passing. We note that this scheme involved the use of public 
as well as a private message at the transmitters. 

C. Gaussian Z-network 

We now turn our attention to Gaussian channel model. The sum capacity for very weak and very strong 

interference can be achieved with one round of message passing. However, it is easy to show that one 
round of message passing does not suffice for strong but not very strong interference. This is because 
after first round of message passing, the users 2 to — 1 have same information and have to send at 
same rate. If this rate has to be optimal for any K, this rate must be | log(l -|- INR -|- SNR) which limits 
the rate of the first and the K^'^ user also to | log(l + INR + SNR). Tlus sum rate will not be optimal if 
K is odd. Hence, the optimal strategy needs also to consider if K is even or odd and where each node 
is placed in the network even with knowledge of SNR, INR. 

Theorem 16. Suppose that each node knows the side information that the network is parameterized by 
three parameters i^, SNR and INR but do not know its relative placement in the network. There exists an 
optimal strategy (achieving sum rate as with global network state information at all the nodes) with one 
round of message passing protocol for any K when one o/INR > SNR(SNR-l-l) or INR(INR-l-l) < SNR 
is satisfied. For any INR and SNR, there exists an optimal strategy with one round of message passing 
forK < 3. For SNR < INR < SNR(SNR + 1), the sum rate o/ f log(l + INR + SNR) + i(log(l + SNR) - 
log(l -|- K=odd,d>K can be achieved for d > 1 round of message passing. This strategy is optimal 

for any even K with 1.5 round of message passing while for odd K with K rounds of message passing. 

Proof: We divide the claims of the theorem into four parts as follows. 

1) INR>SNR(SNR-|-1): Each transmitter knows that it is very strong interference in one round and 

hence the transmitters transmit at a rate of log(l + SNR). Further, all the receivers except the first 
also know that it is very strong interference and are able to decode the messages. We also assume 
in all our achievability techniques that the first receiver will assume that a rate of log(l + SNR) is 
being used by Ti and thus Di will be able to decode. Thus, the sum-capacity of K\og{l -\- SNR) 
is achieved. 

2) INR(INR+ 1) < SNR: With one round of message passing, each node knows if it is the top-most user 
or not. The top-most transmitter sends at a rate of log(l + SNR). However, all the other transmitters 
know that there is one transmitter that interferences at the receiver and hence backs off to send at 
a rate of log(l + ^^^). Thus, the sum rate of log(l + SNR) + (X - 1) log(l + j^) is achieved. 
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This can be shown to be an outer bound on the sum capacity using the same technique for the 

double-Z channel, generalizing the proof of Equation 26 in Appendix D. 

3) INR<SNR, rf=l: Each transmitter knows if it is the top-most transmitter. The top-most transmitter 
sends at a rate of log(l + SNR) while all others send at a rate of log(l + j^^) avoiding interference 
and thus the sum-capacity is achieved for K < 3. 

4) SNR < INR < SNR(SNR + 1), d > 1: In this case, the strategy for deterministic can also be 
extended by changing rates of m — n by log(l + -^|g^p ) and n by log(l + SNR) to get similar result 
as in deterministic model. However, we here consider an alternate strategy. Let, after the 1.5 rounds 
of message passing, each node transmits at a rate of | log(l -I- INR -|- SNR) (0.5 rounds is added 
because the receiver Di needs to know the range of INR to be able to decode). However, when all 
the nodes know the whole state which is after K rounds, they change the strategy to that as in the 
deterministic case if K is odd. This can also be proved to be optimal using a similar proof as in 
the deterministic case. 

■ 

In the very strong interference, the transmitted message is all public and hence the decoders can 
decode both the messages since they know the rates. In the strong but not very strong interference, the 
consideration to even/odd number of users was important due to which the number of rounds of message 
passing increased to K for odd K. Suppose that the nodes do not use node identity in the choice of 
strategies, but only the relative position known through the local information. The intuition in the use 
of 0(K) rounds of message passing is that if constant number of neighboring nodes (let us say 10, 
numbered 2Ki + !,■•■ , 2Ki + 10) know the exact similar structure (which would happen with less than 
0(K) rounds), they choose the same rate. If this constant rate is not ^log(l + SNR + INR), the sum 
rate will not be optimal. Further, pick out even number of users (say 2Ki) from above and the sum 
rate of these top nodes has to be ii'ilog(l -|- SNR + INR), otherwise it won't lead to global optimality. 
Similarly, choose 2K2 users from below such that K — 2K2 = 2Ki + 10 or 2Ki + 11 depending on K 
being odd/even. The sum rate for these chosen nodes from below has to be i^2log(l -I- SNR -h INR) for 
optimality. Thus, if remaining nodes (K — 2(Ki + K2)) are odd, the sum rate is not optimal. Thus, there 
cannot exist an optimal rate allocation with less than 0(K) rounds of message passing. In the very weak 
interference, signal of a transmitter is an interference for the other receivers (to which it is connected). In 
the remaining region of weak but not very weak interference (j|^ < INR < SNR), the signals need to 
be divided into public and private messages and is not considered in this paper for the Gaussian channel 
model. 

VII. Conclusions 

Almost all networks operate with partial network information at different nodes, requiring nodes to make 
distributed decisions. While a rich literature exists on design of network protocols and their analysis, there 
is no prior work to understand the impact of distributed decisions on Shannon-theoretic capacity region. In 
this paper, we laid foundation to characterize partial network information and studied the impact in several 
network connectivities. Seeking universal optimality, where local decisions with certain side information 
are always globally optimal, we discovered that there appears to be a critical minimum information 
required for the network to allow globally optimal decisions. Our current approach is compound capacity 
based and our next step is to understand impact of partial information on fading interference channels. 

Appendix A 
Proof of Theorem 2 

We note that at the end of first round, the first transmitter knows all the channel gains while the second 
transmitter does not know one of the channel gain riu. Let the strategy that the transmitter uses be: 
1) Transmitter 2, which is not producing interference, sends at it maximum possible rate of 7122- 
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2) Transmitter 1 assumes that transmitter 2 is sending at full rate. Thus, transmitter 1 sends at a rate 
of (^11 — ni2)^ if ni2 < n22, thus not sending on any link that produces interference. However, it 
sends at a rate of min(max(nii, 7212) — ^225^^11) if ^^12 > ^^22^ transmitting at the non-interfering 
links to the signal of the second transmitter communicating at the rate n22- 

We now show that this strategy can achieve the sum rate as follows. 

If ni2 < ^22, the sum rate in (7a) simplifies as 



R1 + R2 < (nil - ^12)'^ + ri22. 



(44) 



Hence, the first transmitter will send at a rate of {nn — ^12)^. Further, since the first transmitter knows 
nil and ^12, it can send data on the links at which it is not generating any interference and thus achieve 
the sum capacity. 



Ti 



(niV- ni2) 



— C 



Di 



n-22 



-V«-C 



^D2 



Fig. 9. In this case, ni2 < 7122- The bold hnes show the active bits. 




Fig. 10. In this case, ni2 > 7122. The bold lines show the active bits. 

Now consider the case when ni2 > ^22, in which case the sum rate in (7a) simplifies as 

R1 + R2 < min(max(nii,ni2),nii + ?222)- 



(45) 



Thus, the first transmitter sends at a rate of min(max(raii, 7112) — ^22, ^n) = {nn — 7212)^ + min(ni2 — 
^22, ^ii). This is achieved by sending along the dimensions that can not even be heard at the receiver 2 
which are (nn — ^12)^ in number. In addition, among the dimensions that can be heard by the second 
receiver, the data is sent along the dimensions which do not produce an interference to the direct signal. 
These are min(ni2 — ^22, ^n) in number; see Figure 10. The extra half round is required for the receiver 
to learn the strategy used by the transmitters. Thus, the maximum sum rate can be achieved with 1.5 
rounds. 



Appendix B 
Proof of Theorem 5 

A. The rates mentioned in (12) -(16) can be decoded at the receivers 

In this subsection, we will show that the rates mentioned in (12)-(16) can be decoded at the receivers. 
In order to see the decoding process at the two users, consider the following scenarios. 

1) INR2 < SNR2. The first receiver will be able to decode the data as the rate is supported by the 
power sent. The second receiver treats the first user's signal as interference. The interference power 
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is INR2/(1 + INR2). Thus, the second receiver is able to decode its data treating this power as 

interference. 

2) INR2 > SNR2, INR2 > SNRi. Since the received power from the signal of the first transmitter is 
SNRi and rate < log(l + SNR1), the first receiver is able to decode. At the second receiver, the two 
signals are decoded jointly. We have to verify: 

i?i <log(l + INR2) (46a) 
R2 < log(l + SNR2) (46b) 
Ri + R2< log(l + INR2 + SNR2) (46c) 

The first two hold. For the third, R1 + R2 < log (1 + j^^^U, ] +log(l+SNR2(l + INR2)/(l + 

2INR2)) = log(l + INR2 + SNR2). Hence proved. 

3) INR2 > SNR2, INR2 < SNRi. In this case, there is both a public and private message from the first 
user. At the first receiver, we need to decode the public data treating the other as noise. Further, the 
private data can be decoded. To check the first, we need to see 

, / SNRilNR2/(l + INR2)\ 
^■-'^'°H'^ l + SNR./(l + INRj) j 



Thus, 



\ml SNR1INR2 

< — ^ — - (48) 



1 + 2INR2 + SNR2(1 + INR2) - 1 + INR2 + SNRi 



It is enough to prove i+2inr2+'sS|2(i+inr2) ^ i+inSinr^ ^^^^^ INR2 < SNRi will prove the rest. 
The first part trivially holds too. 

At the second receiver, we need to show that we can jointly decode the common message of user 
1 and the data of user 2 treating the private message from the first user as noise. For this, we need 
the following: 

, / INR^/(1 + INR2) \ 

^ '°«0+ i + inr'/(i + inrJ (^'"^ 

n / S N R2 + I N R^/ ( 1 + I N R2) \ 

We note that all these three conditions are satisfied. 

B. Difference between achievability and outer bound 

To show that the achievable sum rate is within 2 bits of the outer bound, we consider the following 
regimes. 

1) INR2 < SNR2, INR2 < SNRi. 
Achievable sum rate 

Rac = log(l + SNRi/(l + INR2)) + log(l + SNR2(1 + INR2)/(1 + 2INR2)) (50) 
Outer bound on sum rate: 

R^ = log(l + SNRi) + log (1 + ^^^) (51) 
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Rco-Rac = log(l + SNRi)+log(l + INR2 + SNR2) -log(l + INR2 + SNRi) 

- log(l + SNR2(1 + INR2)/(1 + 2INR2)) (52a) 

< log(l + INR2 + SNR2) -log(l + SNR2(l + INR2)/(l + 2INR2)) (52b) 

< log(l + 2SNR2) -log(l + SNR2/2) (52c) 

< 2 (52d) 

2) INR2 < SNR2, INR2 > SNRi. 
Achievable sum rate 

Rac = log(l + SNRi/(l + INR2)) + log(l + SNR2(1 + INR2)/(1 + 2INR2)) (53) 
Outer bound on sum rate 

i?eo = log(l + SNR2 + INR2) (54) 



Rco-Rac = log(l + INR2 + SNR2) -log(l + INR2 + SNRi)+log(l + INR2) 

- log(l + SNR2(1 + INR2)/(1 + 2INR2)) (55a) 

< log(l + INR2 + SNR2) -log(l + SNR2(l + INR2)/(l + 2INR2)) (55b) 

< log(l + INR2 + SNR2) -log(l + SNR2/2) (55c) 

< log(l + INR2 + SNR2) -log(l + (SNR2 + INR2)/4) (55d) 

< 2 (55e) 

3) INR2 > SNR2, INR2 > SNRi. 
Achievable sum rate 

Rac = log I 1 + min I SNRi, , , 'ZZmr, ' 1 1 + log(l + SNR2(1 + INR2)/(1 + 2INR2)) (56) 



' 1+SNR2+21NR2 
Outer bound on sum rate 




i?eo = log(l + SNR2 + INR2) (57) 



If — — 1ni^^nr2 — ^ SNRi, achievability matches the outer bound. 
INR2 

SNR2INR2 



1+SIMR2+2HMR2 

However, if > SNRi, consider Rco = log(l + SNRi) + log(l + SNR2) to show that 

1+ 1 



I+SNR2+2INR2 

4) INR2 > SNR2, INR2 < SNRi. 
Achievable sum rate 

/ INR^ \ 

- + 1 + 2INR2 + SNR2(1 + ■NR2) ) + ^"g^^ + ^^^-/^^ + '^^-^^ 

+ log(l + SNR2(1 + INR2)/(1 + 2INR2)) (58) 

Outer bound on sum rate: 

R,o = log(l + SNRi) + log (l + ^^^) (59) 
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Note that 



+ log(l + SNR2(1 + INR2)/(1 + 2INR2)) (60a) 
> log f 1 + + log(l + SNR./d + INR.)) (60b) 



Thus, 



1 + INR2/(1 + INR2) 

>- - iSarNR, ) + + SNR,/(1 + INR,)) (600) 

(|Mp2 \ 

1 + i^2\mj + ^""^^^ + ^^^'/^^ + '^^'^^ ^^^^^ 

> log(l + 2INR2 + \ml) - log(l + 2INR2) + log(l + SNRi) 
-log(l + INR2) (60e) 

= log(l + INR2)-log(l + 2INR2)+log(l + SNRi) (60f) 

> log(l + SNRi) - 1 (60g) 

Rco-Rac < log(l + SNRi)+log (1+ i^^inrJ -log(l + SNRi) + l (61a) 

< 2 (61c) 

Appendix C 

a class of three-user deterministic channels 

In this Appendix, we will introduce a class of deterministic double Z channels and find a capacity 
region for this class of channels. 

We consider a class of deterministic discrete memoryless IPC's in which the outputs Yi, Y2, and Y3, 

and the interferences Vi, and V2 are (deterministic) functions of the inputs Xi, X2 and Xs as follows: 

Y, = /i(Xi) (62a) 

Y2 = /2(X2,V1) (62b) 

Ys = f3{X,,V2) (62c) 

Vi = gi{X,) (62d) 

V2 = 92iX2), (62e) 

where /i(.), /2(.), /3(-)' ^i(-) and g2{.) are deterministic functions. Further, let /i(.), /2(.) and /3(.) 
satisfy 

H{Yi\Xi) = (63a) 

H{Y2\X2) = H{Vi) (63b) 

H{Ys\Xs) = H{V2). (63c) 
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Theorem 17. The capacity region for the class of deterministic double Z channels is given by 



Ri < H(Yi) (64a) 

R2 < H{Y2\V^) (64b) 

R3 < H{Y3\V2) (64c) 

R1 + R2 < H{Yi\Vi) + H{Y2) (64d) 

R2 + R3 < H{Y2\ViV2) + H{Y3) (64e) 

Ri + R2 + R3 < H{Y^\Vi) + H{Y2\V2) + H{Y3) (64f) 

The remaining part of the section is devoted to the proof of this Theorem. 



A. Han-Kobayashi Region for a general DMC 

The achievability of the above theorem will follow by specializing the Han-Kobayashi region to the 
class of deterministic channels. In this subsection, we will describe the Han-Kobayashi region for a general 
discrete memoryless channel. Let Un, U12, U22, U23 and U3 be the auxiliary variables and Q be the time 
sharing auxiliary variable satisfying the following conditions 

1) Uii, U12, U22, U23 and U3 are conditionally independent given Q. 

2) = /i(C/n, UM, X2 = /2(C/22, C/23IQ) and X3 = /alC/alQ). 

3) Pr{yi = yi,Y2 = ^2,5^3 = Vsl^i = Xi,X2 = X2,X3 = X3} = w{yi,y2,y3\xi,X2,X3) 

Theorem 18. Under the above constraints, the Han-Kobayashi rate region is given by {Ru + R12, R22 + 
R23, R3), where Ru, R12, R22, R23, R3 satisfy following set of inequalities: 



Ru 


< 




(65a) 


R12 


< 


h 


(65b) 


Ru + R12 


< 


Cl 


(65c) 


R12 


< 


di 


(65d) 


R22 


< 


ei 


(65e) 


R23 


< 


/i 


(65f) 


R12 + R22 


< 


91 


(65g) 


R12 + -R23 


< 


hi 


(65h) 


R22 + R23 


< 


k 


(65i) 


R12 + R22 + -R23 


< 


ji 


(65j) 


-R23 + R3 


< 




(65k) 


R23 


< 


h 


(651) 


R3 


< 


nil 


(65m) 


—Ru 


< 





(65n) 


—R12 


< 





(65o) 


—R22 


< 





(65p) 


—R23 


< 





(65q) 


-R3 


< 


0, 


(65r) 
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ai 


— J l-f 1 


TL, iTLr. n") 




Ol 


— T(V. 


'^12|'^ll5 Vj 


) 


Cl 


— T(V. 






"1 


— T(V^ 

— J \J-2 


(^12|(^22, <^23) 


\\J\3\X) 




— T(V^ 

— J \J-2 


(^22|(^12, (^23, 




/i 


— T(V„ 


<^23|<^12) 1722) ) 




yi 


— T(V^ 

— J- 


<^12) <^22|<^23) 




All 


— J \J-2 


(^12, L'23|<^22, 






— T(V 


7"r 7"r IT"/" D\ 
1^22, <^23|<^12, 




Ji 




C/l2,C/22,C/23|Q) 


(66j) 






U23,Us\Q) 


(66k) 


/i 




U2S\U3,Q) 


(661) 


mi 


= I{Y3 


U3\U23,Q) 


(66m) 



Proof: The proof is an easy extension of [14] and is therefore omitted. ■ 
Theorem 19. The above rate region can be simplified to 



Ri 


< 


min(ci, fli + di) 


(67a) 


R2 


< 


min(ii, d + h) 


(67b) 




< 


mi 


(67c) 


Ri + R2 


< 


ai + min(ji,5fi + h) 


(67d) 


R2 + R3 


< 


ki + ei 


(67e) 


2i?i + R2 


< 


2ai + Qi + hi 


(67f) 


Ri + R2 + ^3 


< 


oi + 51 + ki 


(67g) 


-Ri 


< 





(67h) 


-R2 


< 





(67i) 


-R3 


< 





(67j) 



Proof: The proof follows by repeated use of Fourier Motzkin Elimination Algorithm as described 
below. In Equations (65a)-(65r), replace i?ii by Ri — R12. Further, we separate the terms not containing 
R12, containing R12 with positive sign on the left of < and containing negative sign on the left of < as 
follows. 



36 



Terms not containing R12: 

(68a) 
(68b) 
(68c) 
(68d) 
(68e) 
(68f) 
(68g) 
(68h) 
(68i) 
(68j) 

(69a) 
(69b) 
(69c) 
(69d) 
(69e) 

(70a) 
(70b) 

Hence, the Han-Kobayashi rate region is equivalently given by Equations (68a)-(70b). R12 can be elimi- 
nated from Equations (69a)-(70b) using Fourier Motzkin Elimination technique to get the following 

(71a) 
(71b) 
(71c) 
(7 Id) 
(71e) 
(7 If) 
(71g) 
(71h) 

The rate region is now equivalently given by Equations (68a)-(68j), (71a)-(71h). In these equations, 
substitute R22 = R2 — ^23 and separate the terms not containing R2S, containing R23 with positive sign 
on the left hand side of <, and the terms containing negative sign on the left of < as shown below. 



Ri 




Cl 


D 

-K22 




ei 


D 

-n.23 




h 


ID 1 TD 

K22 + ^23 




h 


ID 1 ID 

K22 + -ft3 






-^23 


< 


•'1 


i?3 


< 


mi 


—R22 


< 





—R23 


< 





-Rs 


< 





positive sign: 






R\2 


< 


mill 


R12 + R22 


< 


9i 


R12 + R23 


< 


hi 


R12 + -R22 + R23 


< 


h 


R12 — Ri 


< 





negative sign: 






Ri — R12 


< 




—R12 


< 








Ri 


< 


ai + min(6i, di) 


Ri 


+ R22 


< 


ai + gi 


Ri 


+ R23 


< 


ai + hi 




R22 


< 


9i 




R23 


< 


hi 


Ri + R22 


+ -R23 


< 


ai + ji 


R22 


+ -R23 


< 


ji 




-Ri 


< 
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Terms not containing it!23: 

Ri < min(ci, cii + bi, a-i + di) (V2a) 

i?2 < min(ii,ji) = ii (72b) 

R1 + R2 < ai+ji (72c) 

-Ri < (72d) 

R3 < mi (72e) 

-R3 < (72f) 

Terms containing R23 with a positive sign: 

i?23 < min(/i,/ii,/i) (73a) 

i?3 + i?23 < h (73b) 

-R2 + R23 < (73c) 

R1 + R23 < ai + /ii (73d) 

Terms containing i?23 with a negative sign: 

R2-R23 < min(ei,5ii) = ei (74a) 
Ri + R2-R23 < ai + gi (74b) 
-R23 < (74c) 

The rate region is now equivalently given by Equations (72a)-(74c). EUminating R23 from Equations 
(73a)-(74c) using Fourier Motzkin EUmination, we get 

l) (75a) 

(75b) 
(75c) 
(75d) 
(75e) 
(75f) 
(75g) 
(75h) 
(75i) 
(75j) 





R2 


< 


min(ei + /i, ei + /ii, ei 


Ri 


+ R2 


< 


ai + gi + mm{fi,hi,li) 


Ri 


+ R2 


< 


ei + oi + hi 


2Ri 


+ R2 


< 


ai + gi + ai + hi 




Ri 


< 


oi + hi 


R2 


+ R3 


< 


ki + ei 


Ri + R2 


+ R3 


< 


ai + gi + ki 




R3 


< 


ki 




Ri 


< 


«! +5'l 




-R2 


< 





38 



The rate region is now given by Equations (72a)-(72f), (75a)-(75j). Combining these terms, the rate region 



can be described as follows. 

Ri < min(ci, ai + 61, ai + di, ai + /ii, ai + gii) (76a) 

R2 < min(ii, ei + /i, ei + /ii, ei + Zi) (76b) 

< min(/ci,mi) (76c) 

R1 + R2 < mm{ai+ji,ai + fi + gi,ai + gi + hi,ai + ei + hi,ai + gi + li) (76d) 

R2 + R3 < h + ei (76e) 

2R1 + R2 < 2ai + gi + hi (76f) 

R1 + R2 + R2 < ai + gi + ki (76g) 

-Ri < (76h) 

-R2 < (76i) 

-R3 < (76j) 



Since Ci < ai + 61, rfi < hi and di < gi, the Ri equation is same as in the statement of the Theorem. 
Since ii < Ci + /i < ei + hi, the R2 equation is same as in the statement of the Theorem. Further, 
TTT'i < ki gives Rs same as in the statement of the Theorem. As ji < fi+ gi, ji < ei + hi and fi < hi, 
Ri + R2 bound is also same as in the statement of the Theorem. The rest of the statements directly follow. 



B. Specializing Han-Kobayashi Achievability for model in Theorem 17 

To show the achievability, we specialize the Han-Kobayashi Rate region by taking the time- sharing 
variable as trivial, and using Uu = Vi, U23 = V2, X2 = hi{U22, U23), Xi = h2{Uii,Ui2), U22 = 5'3(^2), 
Uii = g4{Xi) and U3 = X3 for some deterministic functions hi, /12, g3 and g^. With these substitutions, 
we now show that the Han-Kobayashi rate region reduces to that in the statement of the Theorem 17. 

We consider all the equations in Han-Kobayashi region one by one. 

From (67), i?i < min(ci, ai+di). Here, ci = /(Fi;Xi) = H{Yi) andai+di = H{Yi\Vi)+H{Y2\X2) = 
H(Yi, Vi). Thus, the above reduces to Ri < H(Yi) which is same as in the statement of the Theorem 17. 

From (67), R2 < min(ii,ei + h). Here, ii = H{Y2\Vi) and ei + h ^ H{Y2\ViV2) + H{Ys\Xs) = 
H{Y2\ViV2) + H{V2) > hIy2,V2\Vi) > H{Y2\Vi). Thus, this bound also reduces to the same as in the 
statement of the Theorem 17. 

From (67), R3 < rrii = H{Y3\V2). 

From (67), R1 + R2 < ai + mm{ji, gi + h). Here, ji = H{Y2) < //(Fs, ^2) = H{Y2\V2) + HiY^lX^) = 
gi + h. Thus, this reduces to Ri + R2 < H{Yi\Vi) + H{Y2) as in the statement of the Theorem 17. 

From (67), R2 -\- R3 < ki -\- ti — H{Y2) + -^^(^2! 141^2) is the same as in the statement of the Theorem 
17. 

From (67), 2Ri + R2 < 2ai + gi + hi = 2H{Yi\Vi) + H{Y2\V2) + H{Y2\U22). We will now show that 
this constraint is looser than the sum of the constraints on Ri and Ri + R2, which proves that this is not 



a limiting condition on the rate region. 

2H{Yi\Vi) + H{Y2\V2) + H{Y2\U22) (77a) 

= 2H{Yi\Vi) + H{Y2\V2) + H{Y2\U22) - H{Y2\U22V2) + H{Y2\U22V2) (77b) 

= 2H{Yi\Vi) + I{Y2] U22\V2) + H{Y2\U22) + H{Y2\X2) (77c) 

2H{Yi\Vi) + /(Fa; U22) + ^^(>2|^22) + H{Vi) {lid) 

= H{Yi,Vi) + H{Y2) + H{Yi\Vi) (11c) 

> H{Yi) + H{Y2) + H{Yi\Vi), (llf) 
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where (a) follows since V2 and U22 are independent. Thus, the bound of 2Ri + R2 is redundant. 

From (67), Ri + R2 + R3 < ai + gi + ki = H{Yi\Vi) + H{Y2\V2) + H{Y.,) which proves the last 
condition in Theorem 17. This completes the proof of the achievability of the rate region. 

C. Converse for Theorem 17 

The individual bounds on R2, R3 and Ri + R2 follow the same steps as in [12] and are omitted. 
For R2 + R3, 

n{R2 + R3) < I{X'^';Y^') + I{Xl,Y^')+ne (78a) 

< I{X^^^-Y,^\V,n + I{X^;Y,^)+ne (78b) 

< I{X^;V,-Y,-\Vr) + H{Y,-)-H{YJ^\X;^)+ne (78c) 
= I{Xl, V.^lVl') + I{X^; Y^\V^, V^) + - H{V^) + ne (78d) 

i E{y:^)^E{y:^\V:^,V^) + E{Y^)-E{y^)+ne (78e) 
= E{Y:^\V:^,V^)^E{Y^)^ne, (78f) 

where (b) follows since E{^2) < H{Y2,Vi) implies E{Y2) < E{Vi) + //(Fal^i) = H{Y2\X2) + 
E{Y2\Vi) = E{Y2\X2) + E{Y2\Vi) - E{Y2\X2,Vi) , and (c) follows since Vi and V2 are independent. 
Taking n ^ 00 and by the convexity properties of the region, the bound in the statement of Theorem 17 
is obtained. 

For the bound on Ri + R2 + R3, 

n{Ri + i?2 + R3) 

< /(Xf; Y;') + /(X2"; + /(^3 ; + ne (79a) 
= /7(Fi") + EiY^) - Eiyi') + E{Y^) - E{V^) + ne (79b) 

< e{y;') - E{yi') + E{yi'\Yi') + h{y^') - Eiy^) + E{y:^\Y;') + e{y;') + ne (79c) 

= E{Y^\V^^) + E{Y;'\V^) + E{Y^)+ne (79d) 

Taking n — > cxo and by the convexity properties of the region, the bound in the statement of Theorem 17 
is obtained. 

Appendix D 

Outer Bounds for Gaussian 3-user double Z-channel 

In this Section, we prove the sum rate bounds in the second and the fourth cases, more precisely. 
Equations (22)-(23), (26)-(27). 

A. INR2 > SNRi and INR3 < SNR2 

Note that there is a strong interference between first two users. Hence both Xi and X2 can be decoded 
from Y2. Thus, 

n(i?i + i?2 + i?3) < I{X'l,X^;Y^) + I{Xl,Y^)+ne (80) 

Note that Y2 = ^INR^Xi + ^SNR^Xs + Z2 and F3 = ^INR^Xs + ^SNR^Xg + Z3. Let S = x/INR^Xs + V 
where V is zero mean unit variance complex Gaussian random variable independent of Xi, X2 and X3 
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(can depend on Z's though). By providing genie to the second receiver, 

n{R, + R2 + Rs) < I{X^,X^-Y,^,S^) + I{Xl,Y^)+ne (81a) 

= I{X^,X^;Y,-,S'^) + h{Y,-)-h{S-)+ne (81b) 

= I{X^,X^;S'') + I{X^,X^;Y^\S'') + hiY^"") - hiS"") +ne (81c) 
= - /i(l/") + /i(y2''|'5") - hiZ^lV) + hiY^"") 

-hiS"") + ne (81d) 

= -hiV) + hiY^lS"") - hiZ^lV) + h{Y^) + ne (81e) 

< -hiV") + nh{Y2,G\SG) - hiZ^lV") + nh{Ys,G) + ne (81f) 

The last step follows since Gaussian maximizes conditional entropy as well as marginal entropy [15]. 
Further, choosing e arbitrarily small, 

{R, + R, + Rs) < -h{V) + h{Y,,G\SG) - h{Z2\V) + h{Y,,G) (82a) 

= h{Ys,G) - HSg) + h{Y2,G\SG) + HSg) - h{Z2,V) (82b) 

= IiY,,G;Xs,G) + hiY2,G,SG)-h{Z2,V) (82c) 

The last step followed by choosing V independent of X3 g. 

{Ri + R2 + R3) < IiY3,G;X,,G) + hiY2,G,SG)-h{Z2,V) (83a) 

< I{Ys,G;X3,G) + HY2,G,SG)-h{Z2,V\X,,GX2,G) (83b) 

= I{Ys,g;Xs,g) + I{X,,gX2,g;Y2,g,Sg) (83c) 



For (INR2 + 1)INR3 < SNR2, we can choose V = y ^^sm^' ^ho + a/ + where Z„ is 
additional random variable independent of all other random variables, complex normal to make variance 
of V unity. With this choice, /(-^i,g-^2,g; ^2,g, 'S'g) — -^(-^i,g-^2,g; ^2,g)- Thus, the above formula can 
be obtained. 



For (INR2 + 1)INR3 > SNR2, we can choose V = c(^^|^Xi,g + -^§^^2) where c < 1 is chosen 
to make the above unit variance. With this choice, 

-^(-'^1,G-^2,g; ^2,G) Sg) 

= Y2,g, Sg\X2,g) + I{X2,g; Y2,g, Sg) (84a) 

= /(Xi,G;r2,G|^2,G) +/(^1,G;^g|X2,g1^2,g) +/(X2,G;1^2,G,^g) (84b) 

For the second term I{Xi g', 5'g|-^2 g^2 g)> Sg is deterministic function of X2 g and Y2 g and this term is 
therefore 0. The third term /(X2,g; Y2,g',Sg) = /(^2,g; ^g) +/(X2,g; V'2,g|^g)'. But, /(X2,g; r2,G|^G) = 
as X2,G — Sg — 5^2,g is a Markov Chain. Thus, 

J(Xi,gX2,g;1^2,G,^g) = /(Xi,G;r2,G|X2,G)+/(X2,G;^G) (85) 

This gives the above sum rate. 



B. INR2 < SNRi and INR3 < SNR2 
By Fano's inequality, 

+ i?2 + i?3) < I{X{^;Yr) + I{X^;Y,^) + I{X^;Y,^)+ne (86) 

Note that = y^NR^Xa + y/SNR^ Xa + Z 3, Y2 = v/|NR^Xi + v/SNR^X2 + Z2 and Yi = ^/Sm^Xi + Zi. 
Let 5*1 = yiNR^Xi + Vi and S2 = \/INR^X2 + V2 where Vi and V2 are mutually independent complex 
Gaussians of unit variance and independent of all X^'s. Note that I{X^] Y^) = I{X^- + ^/Sm^X^) 
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since the distributions remain the same 

n{Ri + R2 + R3) < 
< 
< 



+/(X2"; + ySNR^X2"|52") + - hiS2) + ne 

h{Sl') - + h(Y{'\S[') - h{Z['\V{') - h{V^) 

+h{Sl' + ^/Sm2X^\S^) - W') + ^(^3") + ne 
h{S[') + h{Y{'\Sl') - h{Z[\ V[') - /i(\/2") 
+^3"^ + ^/sm^X^lS^) - + hiY^"") + ne 



+ ne 



(87a) 
(87b) 

(87c) 

(87d) 

(87e) 

(87f) 

(87g) 



The last step follows by choosing Vi independent of Z2. Further since conditional and marginal entropies 
are maximized by Gaussians, 



n{R^ + R2 + R3) < hiY,^\S'^)-hiZ'^,V{')-h{V2n 

+ v^SNR^X^'I^^O + + ne 

< nh{Y,G\S,G) - nhiZ,, V,) - nhiV^) 

+nh{Si,G + Vsm2X2,G\S2,G) + nh{Y3^G) + ne 



(88a) 



(88b) 



Letting e 



0, we get 

Ri + R2 + ^3 



< h{Y,G\SiG) - h{Zi,Vi) - h{V2) 



(89a) 



+h{Si,G + VSm2X2,G\S2,G) + h{Ys,G) 

= h{Y,G\SiG) - h{Z,\V,) - h{V,) - h{V2) 

+h{Si,G + VSm2X2,G\S2,G) + h{Ys,G) 

= h{Y,G\S,G) - h{Y^G\SlG,X,,G) - h{V,) - h{V2) 

+h(Si,G + VSm2X2,G\S2,G) + h(Y3,G) 

= I{Y,g; X,,g\Sig) - h{V,) - h{V2) 

+h(Si,G + VSm2X2,G\S2,G) + h(Y3,G) 

= I{Xig; S,g) - HSig) - h{V2) 

+h(S,,G + VSm2X2,G\S2,G) + h{Y3,G) 

The last step follows by choosing Vi independent of Xig- We choose Vi = ■sJ^^Zi + Zia where Zia 
is complex normal independent of all other variables in order to make Vi unit norm. With this choice. 



(89b) 



(89c) 



(89d) 



(89e) 
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Xi,G - Si,G - yi,G becomes a Markov Chain and thus I{Xig; Yi^g, Sig) = /(^ig; ^i,g) = log(l + SNRi). 

R, + R2 + Rs < log{l + Sm,) + h{Y,,G)-h{SiG)-h{V2) 

+h{Si,G + VSNR2X2,g|^2,g) (90a) 

= log(l + SNRi) + h{Y,,G) - h{S,G) - h{V2) - h{S2G) 

+ h{S,,G + VSmX2,G, S2,g) (90b) 

= log(l + SNRi) + h{Ys,G) - HSig) - h{S2G) + h{Vi) - h{Vi) - hiV^) 

+h{Si,G + ^Sm^X^^G, S2,g) (90c) 

< log(l + SNRi) + /i(F3,g) - /i(5iG) - h{S2G) + /i(V^i) - V2\X^,gX2,g) 
+/i(5i,G + a/SNR;X2,g, 52,g) (90d) 

= log(l + SNRi) + h{Y^^G) - KSig) - KS2g) + h{Vi) 

+I{X,^G, ^2,g; 5i,g + Vsm2X2,G, S2,g) (90e) 

Note that the last expression I{Xi^g, X2,g', Si^g + VSNR2X2,g5 'S'2,g) is similar to that in subsection A 
and thus V2 can be chosen as in subsection A. (Si^g + \/SNR^X2,g plays the role of Y'2,g. only difference 
that Z2 is replaced by Vi.) Thus, the same steps give the required bounds on Ri + R2 + R3. 

Appendix E 

Proof that rates can be decoded with 1.5 rounds of message passing 

To show this, we divide the range of INR2, SNR2, INR3 and SNR3 into the following nine cases. 
1) INR2 > SNR2, INR3 > SNR3, INR2 > SNRi, INR3 > SNR2.- The first transmitter makes a codebook 
of rate 

^1 = log (1 + niin ( SNRi, 's^nr'inr. ) ) > (91) 

\ \ 1+SNR2+21NR2 / / 

and uses a power level of 1 to transmit. The second transmitter makes a codebook of rate 

and uses a power level of 1 to transmit. The third transmitter makes a codebook of rate 

i?3 = log (1 + SNR3(1 + INR3)/(1 + 2INR3)) (93) 

and uses a power level of 1 to transmit. 

At the receiver 1, Ri can be decoded since Ri < log(l + SNRi). At the receiver 2, Ri and R2 are 
decoded jointly. We see that 

Ri < log(l + INR2) (94a) 
R2 < log(l + SNR2) (94b) 
R1 + R2 < log(l + INR2 + SNR2) (94c) 

Ri + i?2 equation holds as 

Ri + R2< log (1 + ^, ] + log (1 + i^g;^SNR2) = log(l + INR2 + SNR2). 

V ^+l+SNR2+2INR2 / ^ ^ 
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At the receiver 3, R2 and are decoded jointly. We see that 

R2 < log(l + INR3) (95a) 
R3 < log(l + SNR3) (95b) 
R2 + R3 < log(l + INR3 + SNR3) (95c) 

R2+R3 equation holds as R2+R3 < log ( 1 + -^§55^ ) +log (l + rJ^SI^f^s) = log(l + INR3 + 

SNR3). 

2) INR2 > SNR2, INR3 > SNR3, INR2 > SNRi, INR3 < SNR2; The first transmitter makes a codebook 
of rate 



INR 



R, = log ( 1 + min ( SNRi, . , snr'inr. 1 1 ' (96) 



1+SI\IR2+2INR2 

and uses a power level of 1 to transmit. 

The second transmitter makes two codebooks, the first one of rate 

/ 1 + INR2 mRl \ 

Ri,c - og\^+ 1 + 2INR2 1 + 2INR3 + SNR3(1 + INR3) ) ' ^ ^ 

and the second of rate 

R2,v = log (1 + r^^SNR2/(l + INR3)) . (98) 

The transmitter send the first one at a power of INR3/(1 + INR3) and the second one at a power of 
1/(1 + INR3), adds them up and transmit. 
The third transmitter makes a codebook of rate 

i?3 = log(l + SNR3(1 + INR3)/(1 + 2INR3)) (99) 

and uses a power level of 1 to transmit. 

At the receiver 1, Ri can be decoded since Ri < log(l + SNRi). At the receiver 2, Ri, i?2,c and i?2,p 
are decoded jointly. We see that 

Ri < log(l + INR2) (100a) 

R2,c < log(l + SNR2lNR3/(l + INR3)) (100b) 

R2,p < log(l + SNR2/(l + INR3)) (100c) 

Ri + R2,c < log(l + INR2 + SNR2lNR3/(l + INR3)) (lOOd) 

Ri + R2,p < log(l + INR2 + SNR2/(l + INR3)) (lOOe) 

R2,c + R2,p < log(l + SNR2) (lOOf) 

Ri + R2,c + R2,p < log(l + INR2 + SNR2) (lOOg) 
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The first three sub-equations hold trivially. For the fourth, we see that 

n . INR2 ^ , / INR3 

- ' ' ^ + irglS j ^ ^ 1 + 2INR3 + SNR3(1 + INR3) 1 ''''' 

INR2 \ml 



^'^''^l + Txl^§S^ ' 1 + 2INR3 + SNR3(1 + INR3 



1+SNR2+21NR2 

INR2 \ml 



1 + iJnr^Sr. 1 + 2INR3 + SNR3(1 + INR3) 
Thus, we need to show that 



(102) 
(103) 



INR^ INR2 INR^ 

T I - 

I+SNR2+2INR2 

INR^SNRa SNR2INR3 



1 + 2INR3 + SNR3(1 + INR3) 1 + ^ J^^^S^, 1 + 2INR3 + SNR3(1 + INR3) 

< 0. (104) 
I + SNR2 + 2INR2 + SNR2INR2 I + INR3 - ^ ' 

Note that LHS decreases with SNR3 and hence it is enough to show that above is negative for SNR3 = 0. 
With SNR3 = 0, we find that the above decreases with SNR2 and thus we only need to show for low 
value of SNR2 and as SNR2 > INR3, it is enough to show for INR3 = SNR2. The above reduces to 



INR^(l + INR3 + 2INR2 + INR3lNR2) INR3lNR2(l + INR3 + 2INR2) 2 
(1 + 2INR3)(1 + INR3) ^ I + 2INR3 ^ 

We now show that this decreases with INR2. To see that, differentiate w.r.t. INR2, we get 



' (1 + 2INR3)(1 + INR3) I + 2INR3 ^ ^ 



INR3 



(1 + 2INR3)(1 + INR3) 
INR3(4INR2) - 2INR2(1 + 2INR3) 
I + 2INR3 
INR3 2INR2 



(INR3(2 + INR3) - (I + INR3)') 



(106b) 
(106c) 



(1 + 2INR3)(1 + INR3) I + 2INR3 

^ (l + 2INR3) ^'^'^^^^^ + '^^^^~^'^^^^ ^^^^^^ 

As INR3 < INR2, the above is negative; thus it is enough to prove the above for INR2 = INR3. Thus, it 
is enough to show that 

(1 + 3INR3 + INR^) (I + 3INR3) 
(1 + 2INR3)(1 + INR3) I + 2INR3 - ■ ^ ^ 

which is true. 

To show Ri + R2,p < log(l + INR2 + SNR2/(l + INR3)) , we will show that Ri + R2,p < log(l + INR2 + 

SNR2/(1 + INR3) - (i+snr,+2Inr2+sSr2)(i+inr3) ) which will prove the claim. To see this, it is enough 
to prove that 
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= INR. + SNR./(1 + INR3) - ^ 3^^^ ^ Jnr:Tsn'r' INR.)(1 + INR3) 
This is equivalent to proving 

I + INR2 SNR2 



(I + SNR2 + 2INR2) 



I + 2INR2I + INR3 

IMD CMD ^ I + SNR2 + 2INR2 + SNR2INR2 SNR2 INR2SNR2INR3 

^ 'NR2SNR2 + Y^mR, iTTnr^" (1 + INR3) ^^^^^ 



which is equivalent to 

INR9SNR9INR3 

INR2SNR2/(1 + INR3) = INR2SNR2 - ■■ ,i , ' (110) 

which is true. Hence proved. 

To show R2.C + -R2,p < log(l + SNR2), it is enough to prove that 

I|\|p2 INR^ 

^ fSNR2/(l+INR3)+ — — ^- — — SNR2/(1+INR3) < SNR2 



1 + 2INR3 + SNR3(1 + INR3) ' 1 + 2INR3 + SNR3(1 + INR3 

(111) 

Or, 

+ n , o.Mo ,'cMo , ,Mo ^ SNR2/(l + INR3) < SNR2/(1 + INR3) 



1 + 2INR3 + SNR3(1 + INR3) 1 + 2INR3 + SNR3(1 + INR3) 
Or, 



(112) 

INR3 < SNR2(1 + SNR3) (113) 



which holds since INR3 < SNR2. 
To show Ri + R2,c + R2,p < log(l + INR2 + SNR2), it is sufficient to show that 



INR2 + SNR2/(1 + INR3) + 



I + INR2 INR 



1 + 2INR2 1 + 2INR3 + SNR3(1 + INR3) 



+(INR2 + SNR2/(1 + INR3) -d) 



I + INR2 INR 



1 + 2INR2 1 + 2INR3 + SNR3(1 + INR3) 
< INR2 + SNR2, (114a) 

where d = ^,+sNR,JmT-Sii7Nk){i+^msy Note that the above is equivalent to show 

1 I IMD INR^ 

-SNR,INR3/(1 + INR3) - d + ,^,,^l,^,,^^^^sNk(l^mR,) 

1 _|_ IMD INR'^ 

+(INR, + SNRVd + INR3) - '*)T^TT^M^TSNfe(IT^ ^ ° 
Since this decreases with SNR3, it is enough to show for SNR3 = 0. Note that the above is equivalent to 
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show 

. ^ , r. X . 1 + I N R2 I N R3 , (l + INRs)^ I N 

1 + INR2 \ml , 

-'"^+ l + 2INR.l + 2INR3 '^» <"^' 
Since this decreases with SNR2, it is enough to show the above for SNR2 — INR3 as SNR2 > INR3. 

I + INR2 INR3 \ (1 + INR2)^(1 + INR3) 1 



1 + 2INR2 1 + 2INR3y I + 2INR2 I + 2INR3 

iNR^ / 1 + 1NR2 \ml \ 

' 1 + , — <0 (117) 



(1 + INR3 + 2INR2 + INR3INR2) V 1 + 2INR2 1 + 2INR3^ 
Thus, it suffices to show 

A = -(1 + INR3 + 2INR2 + INR3lNR2)(l + 2INR2)(1 + 2INR3) + (1 + INR3 + 2INR2 + INR3INR2) x 

(1 + INR2)(INR3 + (1 + INR2)(1 + INR3)) - INR^((1 + 2INR2)(1 + 2INR3) + (1 + INR2)INR^) < 

Note that 

dA/d{\m2) = -(1 + INR3 + 2INR2 + INR2lNR3)2(l + 2INR3) - (2 + INR3)(1 + 2INR2)(1 + 2INR3) 
+(2 + INR3)(1 + INR2)(INR3 + (1 + INR2)(1 + INR3)) 
+(1 + INR3 + 2INR2 + INR2lNR3)(INR3 + (1 + INR2)(1 + INR3)) 
+(1 + INR3 + 2INR2 + INR2lNR3)(l + INR2)(1 + INR3) 

-2INR2((1 + 2INR2)(1 + 2INR3) + INR^(1 + INR2)) - INR^(2 + 4INR3 + INR^) 
= -INR3(1 + 3INR^ + INR3 + 2INR2(2 + INR3)) < (118a) 

Thus, it is enough to prove A < for INR2 = INR3, and hence we need to show that 

-(1 + 3INR3 + INR^)(1 + 2INR3)2 + (1 + 3INR3 + INR2)2(i + INR3) 

-INR^((1 + 2INR3)2 + (1 + INR3)INR^) < (119) 

which holds since the above reduces to -INR3(1 + 3INR3 + 2INR3) < 0. 

At the receiver 3, i?2,c and -R3 are decoded jointly treating i?2,p as noise. We see that 

R2,c < log(l + INR2/(l + 2INR3)) (120a) 
R?, < log(l + SNR3(1 + INR3)/(1 + 2INR3)) (120b) 
R2,c + R3 < log(l + INR^/(l + 2INR3) + SNR3(l + INR3)/(l + 2INR3)) (120c) 

To show R2,c + R3 < log(l + INR2/(l + 2INR3) + SNR3(l + INR3)/(l + 2INR3)), we see that i?2,c + < 

log(l + l+2iNR3+S(l+INR3) ) + log(l + SNR3(1 + INR3)/(1 + 2INR3)) < log(l + INR^/(l + 2INR3) + 

SNR3(1 + INR3)/(1 + 2INR3)) as was in the case of 2-user channel. 

3) INR2 > SNR2, INR3 > SNR3, INR2 < SNRi, INR3 > SNR2.- The first transmitter makes two 
codebooks, the first one of rate 

(INR^ \ 
^+ l + 2INR2 + SNR2(l + INR2) j' ^'^'^ 

and the second of rate 



i?i,p = log(l + SNRi/(l + INR2)). 



(122) 
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The transmitter send the first one at a power of INR2/(1 + INR2) and the second one at a power of 
1/(1 + INR2), adds them up and transmit. The second transmitter makes a codebook of rate 

\ -r ^ii>iiv2 Y ^ I+SNR3+2INR3 / / 

and uses a power level of 1 to transmit. The third transmitter makes a codebook of rate 

R3 = log(l + SNR3(1 + INR3)/(1 + 2INR3)) (124) 

and uses a power level of 1 to transmit. 

The first receiver is able to decode following the same steps as in two user interference channel given 
above. The second receiver decodes i?i,c and R2 treating i?2,p as noise. The decoding happens when 

, / INR2/(1 + INR2) \ 

«^ ^ '°<i+ i + inr'/(i;inrJ <'^5^> 

^ r. . ( SNR2 + INR^/(1 + INR2)\ 

These conditions can be easily shown to be satisfied. 

The third receiver does a joint decoding and can decode R-^, and the rate constraints are satisfied as in 
2-user since powers remain same as there while rates now are even lesser. 

4) INR2 > SNR2, INR3 > SNR3, INR2 < SNRi, INR3 < SNR2.- The first transmitter makes two 
codebooks, the first one of rate 

(INR^ \ 
'+l + 2INR2 + SNR2(l + INR2)j' ^'^^^ 

and the second of rate 

= log(l + SNRi/(l + INR2)). (127) 

The transmitter send the first one at a power of INR2/(1 + INR2) and the second one at a power of 
1/(1 + INR2), adds them up and transmit. 
The second transmitter makes two codebooks, the first one of rate 

( I + INR2 INR^ \ 

i?2,c - log (^1 + ^^2INR2 1 + 2INR3 + SNR3(1 + INR3) J ' ^ ^ 

and the second of rate 

^2,p = log (1 + i^^2|NR^ SNR2/(l + INR3)) . (129) 

The transmitter send the first one at a power of INR3/(1 + INR3) and the second one at a power of 
1/(1 + INR3), adds them up and transmit. 
The third transmitter makes a codebook of rate 

i?3 = log(l + SNR3(1 + INR3)/(1 + 2INR3)) (130) 

and uses a power level of 1 to transmit. 

As in 2-user case, the first receiver is able to decode the public message treating private as noise and 
then decode the private message. The third receiver can decode the i?2,c and 7?3 jointly treating i?2,p as 
noise which can be shown following the same steps as in 2-user. However, the calculations at the second 
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receiver are different. The second receiver decodes Ri^c, R2,c and i?2,p jointly treating Ri^p as noise. The 
decoding happens when the following are satisfied. 

INR^/(1 + INR2) 



Ri,c < log 



R 



'2,p 



< log 

< log 



Rl,c + R2,c < log 

Ri,c + R2,p 

R2,c + R2,p 
Rl,c + R2,c + R2,p 



< log 

< log 

< log 



1 + 

1 + 

1 + 

1 + 

1 + 

1 + 

1 + 



INR3/(1 + INR3) 



1 + INR2/(1 + INR2) 

SNR2 
1 + INR2/(1 + INR2) 

SNR2/(1 + INR3) 
1 + INR2/(1 + INR2) 
INR^/(1 + INR2) + SNR2lNR3/(l + INR3 

1 + INR2/(1 + INR2) 
INR^/(1 + INR2) + SNR2/(1 + INR3 

1 + INR2/(1 + INR2) 

SNR2 



1 + INR2/(1 + INR2) 
INR^/(1 + INR2) + SNR2 



(131a) 
(131b) 
(131c) 
(131d) 
(131e) 
(131f) 
(131g) 



1 + INR2/(1 + INR2) 

It is straightforward to see that the individual constraints are satisfied. To see the constraint on Ri^c + R2,c, 
we write the satisfying equation and note as in second case that it is enough to use SNR3 — and 
SNR2 = INR3. Thus, it is sufficient to prove that 

1 + INR2 



INR^ 



\ml 



1 + 2INR2 + INR3(1 + INR2) 1 + 2INR2 1 + 2INR3 



1 + INRo 



\mi 



< 



1 + 2INR2 + INR3(1 + INR2) 1 + 2INR2 1 + 2INR3 
I + INR2 f \ml INR^ 



I + 2INR2 VI + INR2 ^ I + INR2 



(132a) 



This is equivalent to 



-1 + 



INR. 



I + 2INR3 
-(1 + 2INR2 + INR3(1 + INR2)) 



INR^ 



INR^(1 + INR3)(1 + 2INR3 



< 



(133) 



The above expression increases with INR3 and hence it is sufficient to show that the above holds as 
INR2 — > 00, at which also, this is < and hence holds. 

To show Ri,c + R2,p < log (1 + '^^^^^\+rR,Ai+^NRir'^'''^ ) ' Sufficient to show that 



INR^ 



1 + 2INR2 + SNR2(1 + INR2) 
This reduces to 

I + INR2 SNR2 



1 + 



I + INR2 SNR2 



< 



I + 2INR2I + INR3 
1 



< 



I + INR2 INR^ 



I + 2INR2I + INR3 - I + 2INR2 



I + 2INR2I + INR2 

(SNR2(1 + INR2)), 



(134) 



(135) 
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which trivially holds. Note that more precisely. 



INR^/(1 + INR2) + SNR2/(1 + INR 



whprp r1 = INRiSNR2lNR3(l+INR2) 

wiicic u (i+|NR3)(l+2INR2)(l+2INR2+SNR2(l+INR2))- 

The result _R2,c + R2,p < log(l + i_|_inr^/(i!|_inr^) ) holds by using the same steps as in the 2-user Z since 
cancelling ]^_^2inr^2 sides, the LHS is now even smaller than was before. 

Lastly, we need to show -Ri,c + -^2,0 + -^2,^ < log ^1 + ' ^ ^^\ur2 '/( ^4-i n R2 ) ' show this, it is sufficient 
to prove that 

( INR^/(1 + INR2) + SNR2/(1 + INR3) 
1 + INR2/(1 + INR2) 

, / 1 + INR2 \ml 

+ log 1 ' ' 



1 + 2INR2 1 + 2INR3 + SNR3(1 + INR3) 



< log(l+''"^/!?,:''"^^:,^'"M- (136) 



1 + INR2/(1 + INR2 

As before, it is sufficient to prove this for SNR3 = 
Hence, it is sufficient to prove that 



I + INR2 \ml \ I + INR2 INR^ 
-d[l+ ill A 



l + 2INR2l + 2INR3y I + 2INR2I + 2INR3 

2 
3 



^ INR^(1 + INR2) INR^ 



(1 + 2INR2)2 I + 2INR3 
SNR2(1 + INR2)INR3 / I + INR2 INR3 



< (137) 



(1 + INR3)(1 + 2INR2) V I + 2INR2I + 2INR3 

As d increases with SNR2, the above expression decreases with SNR2 and thus it is enough to show that 
above holds for SNR2 = INR3. Thus, 

INR^(1 + INR2) / I + INR2 INR^ 



1 + 



(1 + INR3)(1 + 2INR2)(1 + 2INR2 + INR3(1 + INR2)) V 1 + 2INR2 1 + 2INR3 
^1 + INR2 1 ^ INR^(1 + INR2) 1 



I + 2INR2I + 2INR3 (1 + 2INR2)2 I + 2INR3 
(I + INR2) / I + INR2 INR3 



(1 + INR3)(1 + 2INR2) V 1 + 2INR2 1 + 2INR3 



< (138) 



Or, 



-INR^(1 + INR2)((1 + 2INR2)(1 + 2INR3) + (1 + INR2)INR^) + (1 + INR2)(1 + 2INR2)(1 + INR3) 

x(l + 2INR2 + INR3(1 + INR2)) + INR^(1 + INR2)(1 + INR3)(1 + 2INR2 + INR3(1 + INR2)) 

-(1 + INR2)(1 + 2INR2 + INR3(1 + INR2))(1 + 2INR2 + INR3 + 3INR2INR3) < (139) 

which is true since the above reduces to -INR2(1 + INR2)^INR3(1 + INR2 + INR3) < 

J) INR2 > SNR2, INR3 < SNR3, INR2 > SNRi.- The first transmitter makes a codebook of rate 



INR 



i?i = log I 1 + min I SNRi, , , snr'inr^ 1 1 ' (1^0) 



1+SNR2+21NR2 / 
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and uses a power level of 1 to transmit. 
The second transmitter makes a codebook of rate 



R2 = log (1 + ]^^^SNR2/(1 + INR3) j , (141) 

and uses a power level of 1/(1 + INR3) to transmit. 
The third transmitter makes a codebook of rate 

R3 = log(l + SNR3(1 + INR3)/(1 + 2INR3)) (142) 

and uses a power level of 1 to transmit. 

The first receiver is able to decode as rate < log(l + SNRi). The third user is able to decode treating 
R2 as noise. Further, the second user decodes Ri and R2 jointly. The decoding happens since 

Ri < log(l + INR2) (143a) 
R2 < log(l + SNR2/(1 + INR3)) (143b) 
R1 + R2 < log(l + INR2 + SNR2/(l + INR3)) (143c) 

To show the sum rate, it is enough to prove 

INR2 1 + INR2 ^..^ 

+ -. O.MD SNR2/(1 + INR3) 



IH SNR2INR2 I + 2INR9 

^ ^ I+SNR2+2INR2 ^ ^nMi\2 

INR2 I + INR2 

+ 11 SNR2INR2 1 I o|NR ^^'^^/^^ + ^^^'^ - + ^^^'/^^ + '^^^^ ^^^^"^ 

I+SNR2+2INR2 ^ 

This is equivalent to proving 

(1 + INR2)(1 + SNR2 + 2INR2) 
< INR2(l + 2INR2)(l + INR3) + l + SNR2 + 2INR2 + SNR2lNR2 (145) 

which trivially holds. 

6) INR2 > SNR2, INR3 < SNR3, INR2 < SNRi.- The first transmitter makes two codebooks, the first 
one of rate 

(||\jp2 \ 
^+ l + 2INR2 + SNR2(l + INR2) j' ^'^^^ 

and the second of rate 

i?i,p = log(l + SNRi/(l + INR2)). (147) 

The transmitter send the first one at a power of INR2/(1 + INR2) and the second one at a power of 
1/(1 + INR2), adds them up and transmit. 
The second transmitter makes a codebook of rate 

R2 = log (^1 + ]^^2|NR^ SNR2/(1 + INR3)) , (148) 

and uses a power level of 1/(1 + INR3) to transmit. 
The third transmitter makes a codebook of rate 

Rs = log (1 + SNR3(1 + INR3)/(1 + 2INR3)) (149) 

and uses a power level of 1 to transmit. 

The first decoder can decode in the same way as in 2-user Z, and the third user is able to decode 
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treating R2 as noise. The second receiver can decode Ri^c and R2 jointly treating Ri^p as noise since 

Ri,c < log(l + INR2/(1 + 2INR2)) (150a) 
R2 < log (^1 + SNR2/(1 + INR3) ^^^^INrO ^^^^""^ 
Ri,c + R2 < log (^1 + INR^/(1 + 2INR2) + SNR2/(1 + INR3) j^^^INrO ^^^^""^ 

These conditions are straightforward to verify and are therefore omitted. 

7) INR2 < SNR2, INR3 > SNR3, INR3 > SNRa.- The first transmitter makes a codebook of rate 

i?i = log(l + SNRi/(l + INR2)), (151) 

and uses a power level of 1/(1 + INR2) to transmit. 
The second transmitter makes a codebook of rate 

and uses a power level of 1 to transmit. 

The third transmitter makes a codebook of rate 

i?3 = log (1 + SNR3(1 + INR3)/(1 + 2INR3)) (153) 

and uses a power level of 1 to transmit. 

The 1st receiver and 3rd are able to decode by the same scheme of 2-user. The second receiver can 
decode treating Ri as noise. 

8) INR2 < SNR2, INR3 > SNR3, INR3 < SNR2.- The first transmitter makes a codebook of rate 

i?i = log(l + SNRi/(l + INR2)), (154) 

and uses a power level of 1/(1 + INR2) to transmit. 
The second transmitter makes two codebooks, the first one of rate 

f I + INR2 INR3 \ 

R2,c - log (^1 + 1 + 2INR2 1 + 2INR3 + SNR3(1 + INR3) ) ' ^ ^ 

and the second of rate 

R2,p = log (1 + /+2|NR^ SNR2/(1 + INR3)) . (156) 

The transmitter send the first one at a power of INR3/(1 + INR3) and the second one at a power of 

1/(1 + INR3), adds them up and transmit. 
The third transmitter makes a codebook of rate 

Rs = log (1 + SNR3(1 + INR3)/(1 + 2INR3)) (157) 

and uses a power level of 1 to transmit. 

The second receiver treats Ri as noise and the equation for decoding at second receiver follow the 
2-user equations. 

9) INR2 < SNR2 and INR3 < SNRg.- The first transmitter makes a codebook of rate 

i?i = log(l + SNRi/(l + INR2)), (158) 
and uses a power level of 1/(1 + INR2) to transmit. 
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The second transmitter makes a codebook of rate 

= + r^^^^'^^/^^ + '^'^^O ' ^^^^^ 

and uses a power level of 1/(1 + INR3) to transmit. The third transmitter makes a codebook of rate 

i?3 = log (1 + SNR3(1 + INR3)/(1 + 2INR3)) (160) 

and uses a power level of 1 to transmit. 

Here, second user can treat first user's data as noise. Thus, all are able to decode. 

Appendix F 

The sum rate is within 4 bits with 2.5 rounds of message passing 

A. INR2 > SNR2, INR3 > SNR3, INR2 > SNRi, INR3 > SNR2 

Note that we only need to consider the case when sSnr^^inrz — < SNRi and snr%r3 — < SNR2, 

1+SIMR2+21NR2 ■'■"^ 1+SNR3+21NR3 

since otherwise the 1.5 round scheme was within 4 bits of optimal. 

The rates allocated to the users in this case are: 

Ri = min(log(l + SNRi),log(l + INR2 + SNR2) -i?2) (161a) 

i?3 = log(l + SNR3(l + INR3)/(l + 2INR3)) (161c) 

For log(l + INR2 + SNR2) — -R2 > log(l + SNRi), the sum rate is within 3 bits since Ri is optimal and 
R2+R3 is within 3 bits (2 due to Z, 1 additional due to user 2 backing off). For log(l+INR2+SNR2)— i?2 < 
log(l + SNRi), Ri + R2 is optimal and R^ is within 1 bit. 

B. INR2 > SNR2, INR3 > SNR3, INR2 > SNRi, INR3 < SNR2 
The rates allocated to the users in this case are: 



Ri = min(log(l + SNRi), log(l + INR2) - R2) (162a) 



R2 = log 1 + 



I + INR2 INR 



1 + 2INR2 1 + 2INR3 + SNR3(1 + INR3) 

+ + 1^+2INR2 ^^^^^^^ + '^^=^0 ^^^^^^ 

R3 = log(l + SNR3(l + INR3)/(l + 2INR3)) (162c) 

For log(l + INR2) — -R2 > log (1 + SNRi), the sum rate is within 3 bits since -Ri is optimal and R2 + R?, is 
within 4 bits (2 due to Z, 2 additional due to user 2 backing off). For log(l + INR2) — i?2 < log(l + SNR1), 
Ri + R2 is within 1 bit and R^ is within 1 bit making overall sum rate within 2 bits. 
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C. INR2 > SNR2, INR3 > SNR3, INR2 < SNRi, INR3 > SNR2 
The rates allocated to the users in this case are: 

, / SNRi \ / / INR^ 

Ri = log 1 + + min log 1 + ^ 



l + INR2y V V I + 2INR2 

R3 = log(l + SNR3(l + INR3)/(l + 2INR3)) (163c) 

For log (1 + ^) < log (1 + SN..(;..H.H.NRi ) _ j,^. 

R, > log (^1 + SNRi/(l + INR2) (^1 + Y^^'^R^) ) > log(l + SNRi/2). 

Thus, Ri is within 1 bit of optimal and R2 + R3 is within 3 bits thus making overall sum-rate within 4 
bits. . X . 

For log (1 + ^) > log (1 + 5^afl±S^) - 



vv 1 + INR2/ V 1 + 2INR2 

> log((l.SNR.)/(l + INR,)((i±M±^M^ 

> log((l + SNRi)/(l + INR2)((l + INR2 + SNR2)/2)) (164) 

and thus Ri + R2 is within 1 bit of optimal and R3 is within 1 bit of optimal. Thus, the sum rate is within 
2 bits. 

D. INR2 > SNR2, INR3 > SNR3, INR2 < SNRi, INR3 < SNR2 
The rates allocated to the users in this case are: 

R, = log(l + SNR,/(1 + INR2)) + min (log (l + ^ 4nr!T|NR2 

logfl+r-^V^.l (165a) 



i?2 = log 1 + 



1 + 2INR2^ 

'2 iiMr\3 

1 + 2INR2 1 + 2INR3 + SNR3(1 + INR3) 



1 + INR2 \ml 



+ + /+2INR2 ^^^^^^^ + '^^=^0 ^^^^^^ 

i?3 = log(l + SNR3(l + INR3)/(l + 2INR3)) (165c) 

If log (1 + i4SSrO < log (1 + I^) - R2, R, = log (1 + 4SSr, ) + log(l + SNRi/(l + 
INR2)) = log(l + SNRi). Thus, the first transmitter transmits at optimal rate and R2 + R3 is within 4 bits. 

If log (1 + i4nr\':L. ) > log (1 + - R, + R2> log(l + SNRi) + log (1 + ^) - 2. 

Thus, the overall sum rate is within 3 bits of optimal. 
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E. INR2 > SNR2, INR3 < SNR3, INR2 > SNRi 
The rates allocated to the users in this case are: 

Ri = min {\og{l + SNRi), log {l + INR2 + J^^^j^) - (166a) 

R2 = log(^l + ^^^tij^SNR2/(l + INR3)) (166b) 
R3 = log(l + SNR3(l + INR3)/(l + 2INR3)) (166c) 

If log(l + SNRi) < log (1 + INR2 + ifr!^) -R2, then user 1 transmits at the optimal rate and R2 + R3 
is within 3 bits of optimal sum-rate, thus, we achieve within 3 bits of the sum capacity. 

If log(l + SNRi) > log (1 + INR2 + ^) - R2, Ri + R2> log (1 + INR2 + > log(l + 

INR2) > log(l + INR2 + SNR2) - 1. Thus, Ri + R2 is within 1 bit of optimal and R3 is within 1 bit and 
thus the sum rate is within 2 bits of optimal. 

F. INR2 > SNR2, INR3 < SNR3, INR2 < SNRi 
The rates allocated to the users in this case are: 

i^i = log(l + SNRi/(l + INR2))+min(log(l + ^^f^|^ 

log fl + - R2) (167a) 



I + 2INR2 

R2 - log(^l + i^^SNR2/(l + INR3)^ (167b) 
R3 = log(l + SNR3(l + INR3)/(l + 2INR3)) (167c) 

If log (1 + i^nr'+inr. ) ^ (1 + iT^) - Ri = log(l + SNRi) and R2 + Rs is within 3 bits 
of optimal and thus the sum rate is within 3 bits of optimal 

If log (1 + i+s"rO > l°g (1 + I») -^2, Ri + R2 = log (1 + ^) +log(l + SNRi/(l + 
INR2)) which is within 2 bits of optimal as in case D. Thus, the overall sum rate is within 3 bits of 
optimal. 

G. INR2 < SNR2, INR3 > SNR3, INR2 > SNRi, INR3 > SNR2 
The rates allocated to the users in this case are: 

Ri = min(log(l + SNRi),log(l + INR2 + SNR2) - i?2) (168a) 

Rs = log(l + SNR3(l + INR3)/(l + 2INR3)) (168c) 

If log(l + SNRi) < log(l + INR2 + SNR2) - R2, the first user sends at optimal rate and R2 + R3 is 
within 3 bits of optimal and thus the sum capacity is within 3 bits of optimal. 

If log(l + SNRi) > log(l + INR2 + SNR2) - i?2, ^1 + ^2 is optimal and thus the sum capacity is within 
1 bit of optimal. 
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H. INR2 < SNR2, INR3 > SNR3, INR2 < SNRi, INR3 > SNR2 
The rates allocated to the users in this case are: 




SNRi \ , . ( ^ , INR^ 



+ min I log I 1 + 




, , I + INR2 INR^ 

i?2 = log 1+ 3 



1 + 2INR2 1 + 2INR3 + SNR3(1 + INR3) 



(169a) 



I + INR2 

INR^ + SNR2(1 + INR2) 
I + 2INR2 

^ ^ I SNR2, ^'p" IP 1 1 (169b) 

I + 2INR2 '1+ J^Sr3 

i?3 = log(l + SNR3(l + INR3)/(l + 2INR3)) (169c) 

If log (1 + ^) < log (1 + ^^»«^) - i?. then 

Thus, the sum rate is within 4 bits of the optimal. 

If loS (1 + I^) > log (1 + - R., 

, / INRi + SNR2(l + INR2)\ / SNRi 

> log(l + SNR2/2) + log(l + SNRi) - log(l + INR2) 

> log(l + SNR2 + INR2) -2 + log(l + SNRi) -log(l + INR2). (170) 
Thus, the sum rate is within 3 bits of the optimal. 

/. INR2 < SNR2, INR3 > SNR3, INR2 > SNRi, INR3 < SNR2 
The rates allocated to the users in this case are: 

i?i = min (\og{\ + SNRi), log (^1 + INR2 + ^^^^j^) - ^2,c, 

log(^l + INR2 + ^^^|^^ -i?2,p,log(l + INR2 + SNR2)-i?2) (171a) 



+ + /+2INR2 ^^'^^^^^ + '^'^'O ^^^^""^ 

i?3 = log(l + SNR3(l + INR3)/(l + 2INR3)) (171c) 

We consider the four cases when the corresponding term in Rx is the minimum: 

1) Minimum is log(l + SNRi): R\ is optimal and R2 + -R3 is within 4 bits. 

2) Minimum is log (l + INR2 + ^^S^) " ^2,0: 

ft + fl. > log (1 + INR, + ») + log (1 + i^SNR,/(l + ,NR3)) 
> log(l + INR2 + SNR2) - 1. 
Thus, i?i + i?2 and i?3 are both within 1 bit resulting in sum capacity within 2 bits. 
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3) Minimum is log (l + INR2 + - R2,p: 



= log I + INR2 + - — +log 1 + 



SNR2 \ , A I + INR2 INR: 



l + INRay/ l + 2INR2l + 2INR3 + SNR3(l + INR3) 

+ log(l + SNR3(1 + INR3)/(1 + 2INR3)) (172) 

> log (^1 + INR2 + Y^^) + log(l + INR3 + SNR3) - 2. (173) 

The first term log (l + INR2 + ifn^) > log(l + INR2) and also > log(l + INR2 + SNR2) -log(l + 
INR3). Thus, the above is within 2 bits of optimal sum capacity. 
4) Minimum is log(l + INR2 + SNR2) — R2'- Ri + R2 is optimal and R3 is within 1 bit of optimal; 
thus the sum capacity is achieved within 1 bit. 



J. INR2 < SNR2, INR3 > SNR3, INR2 < SNRi, INR3 < SNR2 
The rates allocated to the users in this case are: 

= log IH — ^ + mm log 1 ' ^ 



l + INR2y V V I + 2INR5 

, . , INR2 + SNR2lNR3(l + INR2)/(l + INR3)\ ^ 

I ^ + TTmiR^ ) ~ 

INR2 + SNR2(1 + INR2)/(1 + INR3)\ ^ 



R2 = log 



I + INR2 INR^ 

1 + 2INR2 1 + 2INR3 + SNR3(1 + INR3) 



+ + /+2INR2 ^^^^^^^ + '^^'0 ^^^"^^^ 

R3 = log(l + SNR3(l + INR3)/(l + 2INR3)) (174c) 

We consider the four cases when the corresponding term in Ri c is the minimum: 

1) Minimum is log (l + : i?i = log (l + +log (l + ^^^j^) is within 1 bit of log(l + 

SNRi) (will be shown in Appendix F.N). Moreover, i?2,c+^3 is within 2 bits of log(l + INR3+SNR3) 
(shown in Appendix F.l). i?2,p is within 1 bit of log(l + SNR2) - log(l + INR3). Thus, R2 + R3 is 
within 3 bits of optimal and Ri within 1 bit of optimal; thus the overall sum rate is within 4 bits 
of optimal. 

2) Minimum is log (l + 'NRi+SNR.INR3(lH-INR.)/(l+INR3) -| _ ^^^^^ 

INR^ + SNR2(1 + INR2)' 



i?l c + i?2 > log 1 + 

^ 1 + 21NR2 

This sum rate will be larger than in the fourth case, and is therefore omitted here. 
3) Minimum is log (l + iNRi+SNR.a+iNR.)/(i+iNR3) -j ^^-^ ^^^^^ R,+R,+R, = log (l + ^ 

W J- INRi+SNR2(l+INR2)/(l+INR3) \ , f l+INR^ INRg \ i c:mr /i , 

+ I+2INR2 ) + + I+2INR2 1+2INR3+SNR,,(1+INR3) ) + ^^Sl^ + ^l^^^sU + 

INR3)/(1 + 2INR3)) > log (1 + ^) + log (1 + + + '^'^^ + 
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SNRg) - 2. 
. If INR3(INR2 + 1) > SNR2: 



, , SNRi \ , / , INR^ + SNR2(1 + INR2)/(1 + INR3) 

log 1 + -. , ,MD + log 1 + 



l + INR2y °V I + 2INR2 

, , SNRi \ , / INR^ 

> log 1 + z + log 1 + ^ 



1 + INRsy ° V I + 2INR2 
> log(l + SNRi) - 1. 

Thus, the rate within 3 bits of sum capacity can be achieved. 

. If INR3(INR2 + 1) < SNR2: log (1 + ^) + log (1 + INRi+SNR.a+INR.)/(l+INR3) j > j^^^^ ^ 

SNRi) - log(l + INR2) + log(l + INR2 + SNR2/(1 + INR3)) - 1 > log(l + SNRi) - log(l + 
INR2) + log(l + INR2 + SNR2) - log(l + INR3) - 1. Thus, the rate within 3 bits of sum capacity 
can be achieved. 

4) Minimum is log (l + "^'^^^f+2?NR^'^^'^ ) " ^2: In this case, i?i + R2 is within 1 bit of optimal 

(log(l + SNRi) + log ^1 + ifjj^j) and i?3 is also within 1 bit of optimal; thus sum capacity within 
2 bits can be achievea. 

K. INR2 < SNR2, INR3 < SNR3, INR2 > SNRi, INR3 > SNR2 
The rates allocated to the users in this case are: 

Ri = min (^log(l + SNRi), log (^1 + INR2 + ^^^[^^ - 7^2) (175a) 

R2 = log(^l + ^^^ij^SNR2/(l + INR3)) (175b) 
i?3 = log(l + SNR3(l + INR3)/(l + 2INR3)) (175c) 

If log(l + SNRi) < log (l + INR2 + ifn^) - R2, R2 + is within 3 bits of optimal and thus the 
sum rate within 3 bits can be achieved. 

If log(l + SNRi) > log (1 + INR2 + - R2, Ri + R2 + R3> log (l + INR2 + +Rs> 

log(l + SNRi) + i?3 > log(l + SNRi)+log(l + SNR3)-l > log(l + SNRi)+log(l + SNR3 + INR3) -2. 
Thus, sum capacity within 2 bits can be achieved. 

L. INR2 < SNR2, INR3 < SNR3, INR2 > SNRi, INR3 < SNR2 

Note that we will focus on SNR2 < INR3(INR2 + 1), since for SNR2 > INR3(INR2 + 1), one and a half 
round was already within 4 bits. 

The rates allocated to the users in this case are: 

Ri = min (^log(l + SNRi), log (^1 + INR2 + ^-^^[^^ -i?2^ (176a) 

R2 = log(^l + ^^^ij^SNR2/(l + INR3)^ (176b) 
R3 = log(l + SNR3(l + INR3)/(l + 2INR3)) (176c) 

If log(l + SNRi) < log (^1 + INR2 + ifn^) - R2, R2 + R3 is within 3 bits of optimal and thus the 
sum rate within 3 bits can be achieved. 

If log(l + SNRi) > log (1 + INR2 + - R2, Ri + R2 + R3> log (1 + INR2 + + i?3 > 

log(l + INR2) + i?3 > log(l + INR2) + log(l + SNR3) - 1 > log(l + INR2) + log(l + SNR3 + INR3) - 2. 
Thus, sum capacity within 2 bits can be achieved. 
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M. INR2 < SNR2, INR3 < SNR3, INR2 < SNRi, INR3 > SNR2 
The rates allocated to the users in this case are: 

, / SNRi \ / / INR^ 

Ri = log 1 + + min log 1 + ^ 



l + INRs/ V V I + 2INR2 

R2 = log(^l + ^^^til^SNR2/(l + INR3)^ (177b) 
i?3 = log(l + SNR3(l + INR3)/(l + 2INR3)) (177c) 

If log (1 + ^) < log (1 + 'NRi.SNR.a.,.R.) ^^ _ ^ (1 + lim^ (1 + ri^)) > 

log(l + SNRi) — 1. Thus, the sum rate is within 4 bits of the optimal. 

If log (1 + ^) > log (1 + ^^™™^) - R. + R.^ log (1 + ^™™) + 
log (1+1^) > log(l + SNRO + log (i+ 'NRi+SNMi+iNR.) -| _iog(i + |NR,) > log(l + SNRO. Thus, 
R1 + R2 + R3 > log(l + SNRi) + log(l + SNR3) > log(l + SNRi) + log(l + SNR3 + INR3) - 1. Thus, 
the sum rate within 1 bit can be achieved. 



A^. INR2 < SNR2, INR3 < SNR3, INR2 < SNRi, INR3 < SNR2 

Note that we will focus on SNR2 < INR3(INR2 + 1), since for SNR2 > INR3(INR2 + 1), one and a half 
round was already within 4 bits. 

The rates allocated to the users in this case are: 

, / SNRi \ . A A INR^ 

i?i = log 1 + + mm log 1 + ^ 



1 + INR2y V V 1 +2INR2 

fl2 = log (^1 + i^t|^SNR,/(l + INR3)) (178b) 
Rs = log(l+SNR3(l + INR3)/(l+2INR3)) (178c) 

If log (i + iSk) 2 log (i + ""'r,xr" ) - fl.. tt.™ 

Thus, the sum rate is within 4 bits of the optimal. 
If log (l + I^) > l°g (l + '""'~'""-' ) - 



1 + 21NR2 J V 1 + INR2 

2 



> log(l + SNRO + log (1 + ^?i±!^M™)) _ 10,(1 + ,NR,) 

> log(l + SNRi). 

Thus, i?i + i?2 + i?3 > log(l + SNRi) + log(l + SNR3) > log(l + SNRi) + log(l + SNR3 + INR3) - 1. 
Thus, the sum rate within 1 bit can be achieved. 
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